FIGURE l 

GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGTATG 
AGCAGGTCTGAAGACTAACATTTTGTGAAGTTGTAAAACAGAAAACCTGTTAGAAATGTGGTGGT 
TTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTT 
TCATACAT TAC TGCAG TAACAC T CCACCATATAGACC CGGC TT TACCT TAT ATCAGTGACAC T GG 
TACAGTAGC TCCAGAAAAAT GCT TAT T TGGGGCAATGCTAAA.TAT TGCGGCAGT T T TAT GCAT TG 
CTACCAT T TATG T TCGT TATAAGCAAGTT CATGC TCT GAGTCCTGAAGAGAACGT TATCAT CAAA 
TTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGGACTTTCTATTGTGGCAAACTT 
CCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTGTGCTTACCTTTGGTATGGGCTCAT 
TATATATGTTTGTTCAGACCATCCTTTCCTACCAAATGCAGCCCAAAATCCATGGCAAACAAGTC 
TTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGGAGTAAGTGCACTTAGCATGCTGACTTGCTC 
ATCAGTTTTGCACAGTGGCAATTTTGGGACTGATTTAGAACAGAAACTCCATTGGAACCCCGAGG 
ACAAAGGTTATGTGCTTCACATGATCACTACTGCAGCAGAATGGTCTATGTCATTTTCCTTCTTT 
GGTTTTTTCCTGACTTACATTCGTGATTTTCAGAAAATTTCTTTACGGGTGGAAGCCAATTTACA 
TGGATTAACCCTCTATGACACTGCACCTTGCCCTATTAACAATGAACGAACACGGCTACTTTCCA 
GAGATATTTGATGAAAGGATAAAATATTTCTGTAATGATTATGATTCTCAGGGATTGGGGAAAGG 
TTCACAGAAGTTGCTTATTCTTCTCTGAAA.TTTTCAACCACTTAATCAAGGCTGACAGTAACACT 
GATGAAT GCT GATAAT CAGGAAACAT GAAAGAAGCC AT T T GAT AGAT TAT T C T AAAGGAT AT CAT 
CAAGAAGAC TAT TAAAAACACC TATGCC T AT AC T T T T TTAT C T CAGAAAAT AAAGTC AAAAGAC T 
ATG 
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FIGURE 2 

<subunit 1 of 1, 266 aa, 1 stop 
<MW: 29766, pi: 8.39, NX{S/T): 0 

MWWFQQGLSFLPSALVIWTSAAFIFSYITAVTLHHIDPALPYISDTGTVAPEKCLFGAMLNIAAV 
LC I AT I YVRYKQVHALS PEENVI IKLNKAGLVLG I LS CLGLS I VANFQKT TLFAAHVSGAVLT FG 
MGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSSVLHSGNFGTDLEQKLHW 
NPSDKGYVLHMITTAAEWSMSFS FFGFFLTYIRDFQKISLRVEANLHGLTLYDTAPCPINNERTR 
LLSRDI 

Important features : 

Type II transmembrane domain: 

amino acids 13-33 

Other Transmembrane domains: 

amino acids 54-73, 94-113, 160-180, 122-141 

N-myristoylation sites. 

amino acids 57-63, 95-101, 99-105, 124-130, 183-189 
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FIGURE 3 



W 
U1 



CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGGCAGACC 
GTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAGGCAGGAGCCTTC 
CTTACACTTCGCCATGAGTTTCCTCATCGACTCCAGCATCATGATTACCTCCCAGATACTATTTTTTG 
GATTTGGGTGGCTTTTCTTCATGCGCCAATTGTTTAAAGACTATGAGATACGTCAGTATGTTGTACAG 
GTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTAGG 
AGTATTGAATAGCAGCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGG 
TTTTCATGGTGCCTTTTTACATTGGCTATTTTATTGTGAGCAATATCCGACTACTGCATAAACAACGA 
CTGCTTTTTTCCTGTCTCTTATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCC 
P CATT CTCAGCCCAAAACATGGGATCTTATCCATAGAACAGCTCATCAGCCGGGTTGGTGT GATTGGAG 

Q TGACTCTCATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTC 

CTCAGGAATGTGACT GACACGGATATTCTAGCCCTGGA&CGGCGACT GCTGCAAACCATGGATATGAT 
CATAAGCAAAAAGAAAAGGATGGCAATGGCACGGAGAACAATGTTCCAGAAGGGGGAAGTGCATAACA 
AACCATCAGGTTTCTGGGGAATGATAAAAAGTGTTACCACTTCAGCATCAGGAAGTGAAAATCTTACT 
:K g CTTATTCAACAGGAAGTGGATGCTTTGGAAGAATTAAGCAGGCAGCTTTTTCTGGAAACAGCTGATCT 
3 ATAT GCTACCAAGGAGAGAATAGAATACT CCAAAAC CTTCAAGGGGAAATAT T TTAATT TTCTTGGTT 

O ACTTTTTCTCTATTTACTGTGTTTGGAAAATTTTCATGGCTACCATCAATATTGTTTTTGATCGAGTT 
Y) GGGAAAACGGATCCTGTCACAAGAGGCATTGAGATCACTGTGAATTATCTGGGAATCCAATTTGATGT 
3^ GAAGTTTTGGTCCCAACACATTTCCTTCATTCTTGTTGGAATAATCATCGTCACATCCATCAGAGGAT 
J? TGCTGATCACTCTTACCAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTG 
Fiji CTATTAGCACAGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTT 

r. 

AGAATACCGCACCATAATCACTGAAGTCCTTGGAGAACTGCAGTTCAACTTCTATCACCGTTGGTTTG 
ATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAACAGGCACCA 
GAGAAGCAAATGGCACCTTGAACTTAAGCCTACTACAGACTGTTAGAGGCCAGTGGTTTCAAAATTTA 
GATATAAGAGGGGGGAAAAATGGAACCAGGGCCTGACAT T TTATAAACAAACAAAATGC TATGGTAGC 
ATTTTTCACCTTCATAGCATACTCCTTCCCCGTCAGGTGATACTATGACCATGAGTAGCATCAGCCAG 
AACATGAGAGGGAGAACTAACTCAAGACAATACTCAGCAGAGAGCAT CCCGTGTGGATAT GAGGCTGG 
T GTAGAGGCGGAGAGGAGCCAAGAAACTAAAGGTGAAAAATACACTGGAACT CTGGGGCAAGACAT GT 
CTAT GGTAGCTGAGCCAAACACGTAGGAT TTCCGTT TTAAGGT TCACAT GGAAAAGGTTAT AGCT TTG 
CCTTGAGATTGACTCATTAAAATCAGAGACTGTAACAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGCG 
ACTCTAGAGTCGACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATG 
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FIGURE 4 

MSFLIDSSIMITSQILFFGFGWLFFMRQLFKDYEIRQYWQVIFSVTFAFSCTMFELIIFEILGV 
LNS S SRYFHWKMNLCVILL I LVFMVP FY I GYFI VSNIRLLHKQRLLFS CLLWLT FMYFFWKLGDP 
FPILSPKHGILSIEQLISRVGVIGVTLMMjLSGFGAWCPYTYMSYFLRWTDTDIIJUjERRLLQ 
TMDMIISKKKRMAMARRTMFQKGEVHNKPSGFWGMIKSVTTSASGSENLTLIQQEVDALEELSRQ 
LFLETADLYATKERIEYSKTFKGKYFNFLGYFFSIYCVWKIFiyiATINIVFDRVGKTDPVTRGIEI 
TWYLGIQFDVKFWSQHISFILVGIIIVTSIRGLLITLTKFFYAISSSKSSNVIVLLLAQIMGMY 
FVS SVLL I RMSMPLE YRT 1 1 TEVLGELQFNFYHRWFDVI FLVS ALS SI L FL YLAHKQAPEKQMAP 

y Important features: 

Signal peptide: 

j",2 amino acids 1-23 

V 2 . 

Potential transmembrane domains : 

amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398, 
425-444 

13 N-glycosylation sites. 

Ml amino acids 67-70, 180-183 and 243-246 

z 

Eukaryotic cobal ami n -binding proteins 

amino acids 151-160 
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FIGURE s 



AGCAGGGAAATCCGGATGTCTCGGTTATGAAGTGGAGCAGTGAGTGTGAGCCTCAACATAGTTCC 
AGAACTCTCCATCCGGACTAGTTATTGAGCATCTGCCTCTCATATCACCAGTGGCCATCTGAGGT 
GTTTCCCTGGCTCTGAAGGGGTAGGCACGATGGCCAGGTGCTTCAGCCTGGTGTTGCTTCTCACT 
TCCATCTGGACCACGAGGCTCCTGGTCCAAGGCTCTTTGCGTGCAGAAGAGCTTTCCATCCAGGT 
GTCATGCAGAATTATGGGGATCACCCTTGTGAGCAAAAAGGCGAACCAGCAGCTGAATTTCACAG 
AAGCTAAGGAGGCCTGTAGGCTGCTGGGACTAAGTTTGGCCGGCAAGGACCAAGTTGAAACAGCC 
T TGAAAG C T AGC T T T GAAAC T T GCAGC TAT GGCTGGGTTG GAGAT GGAT T C GTG G TCAT C T C TAG 
GATTAGCCCAAACCCCAAGTGTGGGAAAAATGGGGTGGGTGTCCTGATTTGGAAGGTTCCAGTGA 
GCCGACAGTTTGCAGCCTATTGTTACAACTCATCTGATACTTGGACTAACTCGTGCATTCCAGAA 
ATTAT CACCACCAAAGAT CCCATATTCAACAC T CAAAC T GCAACACAAACAACAGAAT T TAT TGT 
tj CAGTGACAGTACCTACTCGGTGGCATCCCCTTACTCTACAATACCTGCCCCTACTACTACTCCTC 
C T GC TCCAGCT TCCACT T CTAT T CCAC GGAGAAAAAAAT T GAT T T GT GT CACAGAAGT T T T T ATG 
O GAAACTAGCACCATGTCTACAGAAACTGAACCATTTGTTGAAAATAAAGCAGCATTCAAGAATGA 
O AGCTGCTGGGTTTGGAGGTGTCCCCACGGCTCTGCTAGTGCTTGCTCTGCTCTTCTTTGGTGCTG 
HI CAGCTGGTCTTGGATTTTGCTATGTCAAAAGGTATGTGAAGGCCTTCCCTTTTACAAACAAGAAT 
.? I CAGCAGAAGGAAAT GAT C GAAAC CAAAGT AGTAAAGGAGGAGAAGGCCAAT GATAGCAACCC T AA 

jj! T GAGGAATCAAAGAAAACT GATAAAAACCCAGAAGAGTCCAAGAGTC CAAGCAAAAC TAC CGT GC 

yi GATGCCT GGAAGC T GAAGT T TAGAT GAGACAGAAATGAGGAGACACACC T GAGGC TGGT T TCT T T 

%0 CAT G C T C C T TAC C C T G C C C C AG C T G G G G AAAT C AAAAG G G C C AAAGAAC C AAAG AAG AAAG T C C A 

C CCCT TGG TT CC TAAC TGGAATCAGCTCAGGAC T GCCAT TGGACTATGGAGT GCACCAAAGAGAAT 

7" GCCCTTCTCCTTATTGTAACCCTGTCTGGATCCTATCCTCCTACCTCCAAAGCTTCCCACGGCCT 
* s% T T C TAGCCT GGCTAT GTCC TAATAATATC CCAC TGGGAGAAAGGAGT T T T GCAAAGTGCAAGGAC 

y CTAAAACATCTCATCAGTATCCAGTGGTAAAAAGGCCTCCTGGCTGTCTGAGGCTAGGTGGGTTG 
Iff AAAGCCAAGGAGTCACTGAGACCAAGGCTTTCTCTACTGATTCCGCAGCTCAGACCCTTTCTTCA 
r| GCTCTGAAAGAGAAACACGTATCCCACCTGACATGTCCTTCTGAGCCCGGTAAGAGCAAAAGAAT 

GGCAGAAAAGTTTAGCCCCTGAAAGCCATGGAGATTCTCATAACTTGAGACCTAATCTCTGTAAA 
J! GCTAAAATAAAGAAATAGAACAAGGC T GAGGATACGACAG TACAC TGT CAGCAGGGAC T GTAAAC 

If; ACAGACAGGGTCAAAGTGT T T T C TC TGAACACAT TGAG T TGGAAT CAC TGT T TAGAACACACACA 

f|| CTTACTTTTTCTGGTCTCTACCACTGCTGATATTTTCTCTAGGAAATATACTTTTACAAGTAACA 

AAAAT AAAAAC TCT TAT AAAT T T CTAT T T T TAT C T GAG T TACAGAAAT GAT TAC TAAGGAAGAT T 
ACTCAGTAAT T T GT T TAAAAAGTAATAAAAT T CAACAAACATT TGCTGAATAGC TAC TAT AT GTC 
AAGTGCT GT GCAAGGTAT TACAC TC T G TAAT TGAATATT AT TCC TCAAAAAATT GCACAT AG TAG 
AACGCT AT CT GGGAAGCT AT T T T T T T CAG T T T TGATATT T C TAGCT TATC TACT TCCAAAC T AAT 
T T TTAT T T T T GC TGAGAC TAAT C T TAT TCATT T TC T C TAATATGGCAACCAT TATAACCT TAAT T 
TATTAT TAACATACC T AAGAAGTACAT T GT TACC TC TATATACCAAAGCACATT T TAAAAGT GCC 
AT TAACAAAT GTATCACT AGCCC TCCT T T T TCCAACAAGAAGGGAC TGAGAGATGCAGAAATAT T 
TGTGACAAAAAATTAAAGCATT TAGAAAACTT 
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FIGURE 6 



MARCFSLVLLLTSIWTTRLLVQGSLRAEELSIQVSCRIMGITLVSKKANQQLNFTEAKEACRLLG 
LSLAGKDQVETALKAS FET CS YGWVGDGFWT SRI S PNPKCGKNGVGVL I WKVPVSRQFAAYC YN 
SSDTWTNSCIPEIITTKDPIFNTQTATQTTEFIVSDSTYSVASPYSTIPAPTTTPPAPASTSIPR 
RKKL I CVTE VFME TS TMS TETEP FVENKAAFKNEAAG FGGVPTALLVLALLFFGAAAGLG FCYVK 
RYVKAFPFTNKNQQKEMIETKVVKEEKANDSNPNEESKKTDKNPEESKSPSKTTVRCLEAEV 

Signal sequence: 

amino acids 1-16 

Transmembrane domain: 

amino acids 235-254 

N-glycosylation site . 

amino acids 53-57, 130-134, 289-293 

Casein kinase II phosphorylation site. 

amino acids 145-149, 214-218 

Tyrosine kinase phosphorylation site. 

amino acids 79-88 

N-myristoylation site. 

amino acids 23-29, 65-71, 234-240, 235-239, 249-255, 253-259 
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FIGURE 7 



CGCCGCGCTCCCGCACCCGCGGCCCGCCCACCGCGCCGCTCCCGCATCTGCACCCGCAGCCCGGC 
GGCCTCCCGGCGGGAGCGAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGG 
CGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGG 
CGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCG 
GCTCTCAGCTACCCGCAGGAGGAGGCCACCCTCAATGAGATGTTCCGCGAGGTTGAGGAACTGAT 
GGAGGACACGCAGCACAAAT TGCGCAGCGCGGTGGAAGAGATGGAGGCAGAAGAAGCTGCT GCTA 
AAGCAT CATCAGAAGT GAACCT GGCAAACT TACCTCCCAGCTAT CACAAT GAGACCAACACAGAC 
ACGAAGGT TGGAAATAATACCATCCATGTGCACCGAGAAAT TCACAAGATAACCAACAACCAGAC 
T GGACAAATGGT CT T T TCAGAGACAGT TAT CACATCT GT GGGAGACGAAGAAGGCAGAAGGAGCC 
ACGAGTGCATCATCGACGAGGACTGTGGGCCCAGCATGTACTGCCAGTTTGCCAGCTTCCAGTAC 
|4 ACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACAGTGAGTGCTGTGGAGACCA 
r| GCTGTGTGTCTGGGGTCACTGCACCAAAATGGCCACCAGGGGCAGCAATGGGACCATCTGTGACA 
f\ ACCAGAGGGACTGCCAGCCGGGGCTGTGCTGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGC 
|f? ACACCCCTGCCCGTGGAGGGCGAGCTTTGCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCAC 
f\ CTGGGAGCTAGAGCCTGATGGAGCCTTGGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGC 
;J; CCCACAGCCACAGCCTGGTGTATGTGTGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGG 
*M GAGATCCTGCTGCCCAGAGAGGTCCCCGATGAGTATGAAGTTGGCAGCTTCATGGAGGAGGTGCG 
Ul CCAGGAGCTGGAGGACCTGGAGAGGAGCCTGACTGAAGAGATGGCGCTGGGGGAGCCTGCGGCTG 
& CCGCCGCTGCACTGCTGGGAGGGGAAGAGATTTAGATCTGGACCAGGCTGTGGGTAGATGTGCAA 
% TAGAAATAGCTAATTTATTTCCCCAGGTGTGTGCTTTAGGCGTGGGCTGACCAGGCTTCTTCCTA 
fj\ CATCTTCTTCCCAGTAAGTTTCCCCTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAGCT 
ifl CCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCA 
%l GGAGCAG T T T GCCACC CCT GTCCAGAT TAT T GGC TGC TT TGCC TC TACCAG T TGGCAGACAGCCG 

Tt TTTGTTCTACATGGCTTTGATAATTGTTTGAGGGGAGGAGATGGAAACAATGTGGAGTCTCCCTC 

TGATTGGTTTTGGGGAAATGTGGAGAAGAGTGCCCTGCTTTGCAAACATCAACCTGGCAAAAATG 
O CAACAAATGAATTTTCCACGCAGTTCTTTCCATGGGCATAGGTAAGCTGTGCCTTCAGCTGTTGC 
I'll AGATGAAATGTTCTGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCC 

TACC T CT GT GCCAGGGCAGCAT T T T CATAT CCAAGAT CAAT TCCCTCTCT CAGCACAGCCT GGGG 
AGGGGGTCATTGTTCTCCTCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCC 
CAAGTCACACAGCTAG TGAAGACCAGAGCAGT T T CAT CT GGTT GT GACTCTAAGC TCAGT GC T C T 
CTCCACTACCCCACACCAGCCTTGGTGCCACCAAAAGTGCTCCCCAAAAGGAAGGAGAATGGGAT 
TTTTCTTGAGGCATGCACATCTGGAATTAAGGTCAAACTAATTCTCACATCCCTCTAAAAGTAAA 
CTACTGTTAGGAACAGCAGTGTTCTCACAGTGTGGGGCAGCCGTCCTTCTAATGAAGACAATGAT 
AT TGACAC TGTC CC TCTT TGGCAGT TGCAT TAGTAACT T T GAAAGGTATAT GAC TGAGCGTAGCA 
TACAGGT TAACC TGCAGAAACAGTAC T T AGG TAAT T GT AGGGCGAGGATTATAAATGAAATT TGC 
AAAATCAC T TAGCAGCAACT GAAGACAAT TAT CAACCACGT GGAGAAAAT CAAA.CCGAGCAGGGC 
TGTGTGAAACATGGTTGTAATATGCGACTGCGAACACTGAACTCTACGCCACTCCACAAATGATG 
TTTTCAGGTGTCATGGACTGTTGCCACCATGTATTCATCCAGAGTTCTTAAAGTTTAAAGTTGCA 
CATGAT T GTATAAGCATGCT T T C T T T GAGT TT TAAAT TAT GT ATAAACATAAGT T GCAT T TAGAA 
ATCAAGCATAAATCACTTCAACTGCAAAAAAAAAAAAAAAAAAAAAAAAAA 



APPJD= 



10063594 



Page 159 of 320 



FIGURE 8 



MQRLGATLLCLLIJ\AAVPTAPAPAPTATSAPVKPGPALSYPQEEATLNEMFREVEELMEDTQHKL 
RSAVEEMEAEEAAAKASSE WLANLPPSYHNE TNTDTKVGNNT IHVHRE IHKI TNNQTGQMVFSE 
TVITSVGDEEGRRSHECIIDEDCGPSMYCQFAS FQYTCQPCRGQRMLCTRDSECCGDQLCVWGHC 
TKMATRGSNGTICDNQRDCQPGLCCAFQRGLLFPVCTPLPVEGELCHDPASRLLDLITWELEPDG 
ALDRCPCASGLLCQPHSHSLVYVCKPTFVGSRDQDGEILLPREVPDEYEVGSFMEEVRQELEDLE 
RSLTEEMALGE PAAAAAALLGGEE I 

Signal sequence: 

amino acids 1-19 

N-glycosylation site. 

amino acids 96-100, 106-110, 121-125, 204-208 
Casein kinase II phosphorylation site. 

amino acids 46-50, 67-71, 98-102, 135-139, 206-210, 312-316, 
327-331 

N-myristoylation site, 

amino acids 202-208, 217-223 

Amidation site. 

amino acids. 140-144 
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FIGURE Q 



CGGACGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCCAATAAATACATCATGCAACCCCAC 
GGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCATCCAAAG 
GCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTCTGGACCCTT 
AACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTTCTACTGGGCCTT 
CCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCCGCACACTCCGTTACC 
ACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTG 
GAGTATAT T GACCACAAGCT CAGAGGAGT GCAGAACCCT GTAGC CCGCTGCATCATGT GCT GT TT 
CAAGTGCTGCCTCTGGTGTCTGGAAAAATTTATCAAGTTCCTAAACCGCAATGCATACATCATGA 
TCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAAAATGCGTTCATGCTACTCATGCGAAAC 
ATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGACCTGCTGCTGTTCTTTGGGAAGCTGCTGGT 
GGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTTTTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAG 
ACTTTAAGAGCCCCCACCTCAACTATTACTGGCTGCCCATCATGACCTCCATCCTGGGGGCCTAT 
GTCATCGCCAGCGGCTTCTTCAGCGTTTTCGGCATGTGTGTGGAGACGCTCTTCCTCTGCTTCCT 
GGAAGACC T GGAGCGGAACAACGGCTCCCT GGACCGGCCC TAC TACAT G T CCAAGAGCCT T C TAA 
AGATTCTGGGCAAGAAGAACGAGGCGCCCCCGGACAAGAAGAAGAGGAAGAAGTGACAGCTCCGG 
Iff CCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGT 
O CTCCATTTTGTGGTAAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACT 
W TTGAGAGGCTGAGGCGGGCGGATCACCTGAGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTG 

AAACCTCCGTCTCTATTAAAAATACAAAAATTAGCCGAGAGTGGTGGCATGCACCTGTCATCCCA 
GCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGA 
GATCGCGCCACTGCACTCCAACCTGGGTGACAGACTCTGTCTCCAAAACAAAACAAACAAACAAA 
AAGATT TT AT TAAAGAT AT TT TG T TAACT C 
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FIGURE 10 



RTRGRTRGGCEKVP INTS CNPTAHLVNS SCPGLMCVFQGYS SKGL I QRSVFNLQI YGVLGLFWTL 
NWVLALGQCVLAGAFAS FYWAFHKPQD I PT FPL I SAFIRTLRYHTGSLAFGAL I LTLVQIARVIL 
E Y IDHKLRGVQNPVARC IMCC FKCCLWCLEKFI KFLNRNAY IMIAI YGKNFCVSAKNAFMLLMRN 
I VRVWLDKVTDLLL FFGKLLWGGVGVLS FFFFSGRI PGLGKDFKS PHLNYYWL PIMTSI LGAY 
VIASGFFSVFGMCVDTLFLCFLEDLERNNGSLDRPYYMSKSLLKILGKKNEAPPDNKKRKK 

Important features: 
i-i: Transmembrane domains: 

CJ amino acids 57-80 (type II), 110-126, 215-231, 254-274 

%*$ 

?fh 

f'~ N~glycosylation sites. 

If! amino acids 16-20, 27-31, 289-293 

%u 

Hypothetical YBR002c family proteins. 

JL amino acids 276-288 

f Ammonium transporters proteins . 

amino acids 204-231 

N-myristoylation sites. 

amino acids 60-66, 78-84 

Amidation site. 

amino acids 306-310 
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FIGURE 11 



GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCCGGGAGCCACCGCC ATG GGGGCCTGCCTGGGAGCCTGC 
TCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAGCTGCTGCCCCGC 
CAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCCTGGGGGTGCTGGTGTCCA 
TCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTACAAGCTGCCCTGGGTGTGTGAGGAGGGGGCC 
GGGATCCCCACCGTCCTGCAGGGCCACATCGACTGTGGCTCCCTGCTTGGCTACCGCGCTGTCTACCG 
CATGTGCTTCGCCACGGCGGCCTTCTTCTTCTTCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCA 
GCCGGGACCCCCGGGCTGCCATCCAGAATGGGTTTTGGTTCTTTAAGTTCCTGATCCTGGTGGGCCTC 
ACCGTGGGTGCCTTCTACATCCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGTGGG 
CTCCTTCCTCTTCATCCTCATCCAGCTGGTGCTGCTCATCGACTTTGCGCACTCCTGGAACCAGCGGT 
GGCTGGGCAAGGCCGAGGAGTGCGATTCCCGTGCCTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTC 
TTCTACTTGCTGTCGATCGCGGCCGTGGCGCTGATGTTCATGTACTACACTGAGCCCAGCGGCTGCCA 
CGAGGGCAAGGTCTTCATCAGCCTCAACCTCACCTTCTGTGTCTGCGTGTCCATCGCTGCTGTCCTGC 
CCAAGGTCCAGGACGCGCAGCCCAACTCGGGTCTGCTGCAGGCCTCGGTCATCACCCTCTACACCATG 
TTTGTCACCTGGTCAGCCCTATCCAGTATCCCTGAACAGAAATGCAACCCCCATTTGCCAACCCAGCT 
GGGCAACGAGACAGTTGTGGCAGGCCCCGAGGGCTATGAGACCCAGTGGTGGGATGCCCCGAGCATTG 
TGGGCCTCATCATCTTCCTCCTGTGCACCCTCTTCATCAGTCTGCGCTCCTCAGACCACCGGCAGGTG 
AAC AGCC T GAT G C AGAC CGAG GAG T GC CC ACC TAT G C TAGAC G C C ACAC AG CAGC AG CAG C AGC AGGT 
GGCAGCCTGTGAGGGCCGGGCCTTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCC 
ACTTCTGCCTGGTGCTGGCCTCACTGCACGTCATGATGACGCTCACCAACTGGTACAAGCCCGGTGAG 
ACCCGGAAGATGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCT 
CCTCTACCTGTGGACCCTGGTAGCCCCACTCCTCCTGCGCAACCGCGACTTCAGC TGA GGCAGCCTCA 
CAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCTGCCCCCTC 
CCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTGCCCCTGAGCCGGGC 
CTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCAGAGCCCCATCCCCCCGCCAC 
ACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGTTGCCCATACTCAGCATCTCGGATGAA 
AGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGGCTGCTGGAGAGAGCGGGGAACTCCCACCACAG 
TGGGGCATCCGGCACTGAAGCCCTGGTGTTCCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTG 
GACTTCGTGCCTTACTGAGTCTCTAAGACTTTTTCTAATAAACAAGCCAGTGCGTGTAAAAAAAA 
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FIGURE 12 

MGACLGACSLLSCASCLCGSAPCILCSCCPASRNSTVSRLIFTFFLFLGVLVSIIMLSPGVESQL 
YKLPWVCESGAGIPTVLQGHIDCGSLLGYRAVYRMCFATAAFFFFFFTLLMLCVSSSRDPRAAIQ 
NGFWFFKFLI LVGLT VGAFY I PDGS FTNIW FYFGWGS FLFI L I QLVLLI DFAHSWNQRWLGKAE 
ECDSRAWYAGLFFFTLLFYLLSIAAVALMFMYYTEPSGCHEGKVFISLNLTFCVCVSIAAVLPKV 
QDAQPNSGLLQASVITLYTMFVTWSALSS I PEQKCNPHLPTQLGNETWAGPEGYETQWWDAPS I 
VGL 1 1 FLLCTL FI S LRS S DHRQVNS LMQTEECPPMLDATQQQQQQVAACEGRAFDNEQDGVT YS Y 
SFFHFCLVIJ^LHVMMTLTNWYKPGETRKMISTWTAVWWICASWAGLLLYLWTLVAPLLLRNRD 
FS 

Signal sequence: 

amino acids 1-20 

Transmembrane domains: 

amino acids 40-58, 101-116, 134-150, 162-178, 206-223, 240-257, 
272-283, 324-340, 391-406, 428-444 
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FIGURE 13 

CGGGCCAGCCTGGGGCGGCCGGCCAGGAACCACCCGTTAAGGTGTCTTCTCTTTAGGGATGGTGA 
GGT TGGAAAAAGAC TCC T GTAACCC TCC T CCAGGATGAACCACC TGCCAGAAGACAT GGAGAACG 
CTCTCACCGGGAGCCAGAGCTCCCATGCTTCTCTGCGCAATATCCATTCCATCAACCCCACACAA 
C TCATGGCCAGGAT TGAGTCC TATGAAGGAAGGGAAAAGAAAGGCATATCT GATGTCAGGAGGAC 
T TTC TGT T T GT T TGT CACCT T TGACC T CT TAT T GGTAACAT T AC TGT GGAT AATAGAGT TAAATG 
T GAAT GGAGGCATT GAGAACACAT TAGAGAAGGAGGTGAT GCAGTATGAC TAC TATT CT TCAT AT 
TTTGATATATTTCTTCTGGCAGTTTTTCGATTTAAAGTGTTAATACTTGCATATGCTGTGTGCAG 
ACTGCGCCATTGGTGGGCAATAGCGTTGACAACGGCAGTGACCAGTGCCTTTTTACTAGCAAAAG 
TGATCCTTTCGAAGCTTTTCTCTCAAGGGGCTTTTGGCTATGTGCTGCCCATCATTTCATTCATC 
CT TGCCTGGAT T GAGACGT GGT TCCT GGAT T T CAAAG TGT T ACC TCAAGAAGCAGAAGAAGAAAA 
CAGACTCCTGATAGTTCAGGATGCTTCAGAGAGGGCAGCACTTATACCTGGTGGTCTTTCTGATG 
G TCAGT T T TAT TCCCCT CC TGAATCCGAAGCAGGATC TGAAGAAGC T GAAGAAAAACAGGACAGT 
GAGAAACCACTTTTAGAACT ATGAG TACTACTTTTGTTA7VATGTG7LAAAACCCTCACAGAAAGTC 
ATCGAGGCAAAAAGAGGCAGGCAGTGGAGTCTCCCTGTCGACAGTAAAGTTGAAATGGTGACGTC 
CACTGCTGGCT T TATTGAACAGCTAATAAAGATT TATTTATTGTAATACCT CACAAACGTTGTAC 
CATATCCATGCACATTTAGTTGCCTGCCTGTGGCTGGTAAGGTAATGTCATGATTCATCCTCTCT 
T CAGT GAGACTGAGCCTGATGTGT TAACAAATAGGT GAAGAAAGTC T T GT GCTGTAT TCCTAATC 
AAAAGAC T T AATATAT T GAAG TAACAC T T T T T TAG T AAG C AAGATAC C T T T T TAT T TC AAT T CAC 
AGAAT GGAATTTTTTTGT T T CATGT C TCAGAT T TAT T TTGTAT T TC TT TT T T AACACTCTACAT T 
TCCCTTGTTTTTTAACTCATGCACATGTGCTCTTTGTACAGTTTTAAAAAGTGTAATAAAATCTG 
ACATGTCAATGTGGCTAGTTTTATTTTTCTTGTTTTGCATTATGTGTATGGCCTGAAGTGTTGGA 
C T TGCAAAAGGGGAAGAAAGGAAT TGCGAATACATGTAAAATG TCACCAGACAT T TGTAT TATT T 
TTATCATGAAATCATGTTTTTCTCTGATTGTTCTGAAATGTTCTAAATACTCTTATTTTGAATGC 
ACAAAATGACT TAAACCATTCATATCATGT TTCCTTTGCGT TCAGCCAAT TTCAATTAAAATGAA 
CTAAATTAAAAA 
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FIGURE 1A 

MNHLPEDMENALTGSQSSHASLRNIHSINPTQLMARIESYEGREKKGISDVRRTFCLFVTFDLLF 
VTLLWI IELNVNGG IENTLEKEVMQYDYYSS YFD I FLLAVFRFKVL I LAYAVCRLRHWWAI ALT T 
AVTSAFLIJ^ILSKLFSQGAFGWLPIISFIIAWIETWFLDFKVLPQEMEENRLLIVQDASER 
AALIPGGLSDGQFYSPPESEAGSEEAEEKQDSEKPLLEL 

Important features of the protein: 
Signal peptide: 

amino acids 1-20 

Transmembrane domains: 

amino acids 54-72, 100-118, 130-144, 146-166 
N-myristoylation sites. 

amino acids 14-20, 78-84, 79-85, 202-208, 217-223 



APP_ID=1 0063594 



Page 166 



FIGURE is 



ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTGAGG 
CCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGAAGATQTGCTCCAG 
GGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCCCTGGGGCCTGGGGTGCAGGGCTGCCCAT 
CCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGGACCACGGTGCCC 
CGAGACGTGCCACCCGACACGGTGGGGCTGTACGTCTTTGAGAACGGCATCACCATGCTCGACGC 
AGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCACAGAACCAGATCGCCAGCC 
TGCCCAGCGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTGGACCTGACGGCCAACAGGCTG 
CATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCTCGAGCGCCTCTACCTGGGCAAGAA 
CCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGCTCGACCGCCTCCTGGAGCTCAAGCTGC 
AGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGCCTGCCCCGCCTGCTGCTGCTGGACCTCAGC 
1^ CACAACAGCCTCCTGGCCCTGGAGCCCGGCATCCTGGACACTGCCAACGTGGAGGCGCTGCGGCT 
pi GGCTGGTCTGGGGCTGCAGCAGCTGGACGAGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACC 
;~J TGGATGTGTCCGACAACCAGCTGGAGCGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACG 
Cl CGCCTGCGGCTGGCCGGCAACACCCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGC 
Iff TGCCCTGCAGGAGCTGGATGTGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCC 
h£ TCTTCCCCCGCC.TGCGGCTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGC 
TZ TGGTTTGGCCCCTGGGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCA 

CTTCCCGCCCAAGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAG 
CCACCACCACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTCT 
«g TCTAGCTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTC 
8 ' CACTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGACTGCCCACCGTCCACCTGCCTCA 
j^l ATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTTCACG 
J ! !f GGCCT G TAC TGTGAGAGCCAGAT GGGGCAGGGGACACGGCCCAGC CC TACACCAGTCACGCCGAG 

l[l GCCACCACGGTCGCTGACCCTGGGGATCGAGCCGGTGAGCCCCACCTCCCTGCGCGTGGGGCTGC 
Q AGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTATCGCAACCTATCG 
jij GGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTGAGTACACGGTCACCCA 
S GCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGGCCCGGGCGGGTGCCGGAGG 
J;!; GCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCACTCCAACCACGCCCCAGTCACC 
W CAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCGCCCTGGCCGCGGTGCTCCTGGCCGC 

GCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGGGGGCGGGCCATGGCAGCAGCGGCTCAGG 
ACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCTGGAACTGGAGGGAGTGAAGGTCCCCTTGGAG 
CCAGGCCCGAAGGCAACAGAGGGCGGTGGAGAGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCC 
ACTCATGGGCTTCCCAGGGCCTGGCCTCCAGTCACCCCTCCACGCAAAGCCCTACATCTAAGCCA 
GAGAGAGACAGGGCAGCTGGGGCCGGGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCC 
ACACCACGTAAGTTCTGAGTCCCAACCTCGGGGATGTGTGCAGACAGGGCTGTGTGACCACAGCT 
GGGCCCTGTTCCCTCTGGACCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCC 
CTAACGTCCCCAGAACCGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTC 
CCTGGGCACGGCGGGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCAC 
T CCAGGCGGACC C TGGGGGCCAG TGAAGGAAGC T CCCGGAAAGAGCAGAGGGAGAGCGGGTAGGC 
GGCT GT GT GACT C TAGTCT T GGCCCCAGGAAGCGAAGGAACAAAAGAAACT GGAAAGGAAGAT GC 
T T TAGGAACATGT T TTGCTTTTT TAAAATATATATAT T TATAAGAGAT CCT T TCCCAT T TAT T CT 
GGGAAGAT GT T T T TCAAACT CAGAGACAAGGAC TTTGGTTTT TGTAAGACAAACGATGATAT GAA 
GGCCTTTTGTAAGAAAAAATAAAAGATGAAGTGTGAAA 



APP ID= 



10063594 



Page 167 of 320 



FIGURE 16 

MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFENGIT 
MLDAGS FAGLPGLQLLDLSQNQI AS L PS GVFQPLANLSNLDLTANRLHE I TNE TFRGLRRLERLY 
LGKNRIRHIQPGAFDTLDRLLELKLQDNELRALPPLRLPRLLLLDLSHNSLLALEPGILDTANVE 
ALRLAGLGLQQLDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRGLTRLRLAGNTRIAQLRPEDL 
AGLAALQELDVSNLSLQALPGDLSGLFPRLRLI^ 

TRCHFPPKNAGRLLLELDYADFGCPATTTTATVPTTRPWREPTALSSSLAPTWLSPTAPATEAP 
SPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPEGFTGLYCESQMGQGTRPSPTP 
|~; ;! VTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRLTYRNLSGPDKRLVTLRLPASIiAEY 
13 TVTQLRPNATYSVCVMPLGPGRVPEGEEACGEAHTPPAVHSNHAPVTQAREGNLPLLIAPALAAV 

LLAALAAVGAAYCWRGRAMAAAAQDKGQVGPGAGP 

tf% 

"is? •' 

f 1 CEVPLMGFPGPGLQSPLHAKPYI 

Iff 

Important features: 
Signal peptide: 

amino acids 1-23 
Transmembrane domain: 

amino acids 579-599 

EGF-like domain cysteine pattern signature. 

amino acids 430-442 
Leucine zipper pattern, 
amino acids 197-219, 269-291 
N-glycosylation sites. 

amino acids 101-105, 117-121, 273-277, 500-504, 528-532 
Tyrosine kinase phosphorylation sites, 
amino acids 124-131, 337-345 
N-myristoylation sites. 

amino acids 23-29, 27-33, 70-76, 142-148, 187-193, 348-354, 
594-600, 640-646 
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FIGURE 17 



GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTC ATOC G 
GGTCCGGATAGGGCTGACGCTGCTGCTGTGTGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGG 
ATGAAGAAGGCAGCCAGGAT GAAT CC TT AGAT T CCAAGACTACT T TGACAT CAGATGAGTCAGTA 
AAGGAC CAT AC T AC T G C AG G CAGAG TAG TTGCTGGT C AAAT AT T T C T T GAT T CAG AAG AAT C T G A 
ATTAGAAT CCT C TAT T CAAGAAGAGGAAGACAGCC TCAAGAGCCAAGAGGGGGAAAGT GTCACAG 
AAGATAT CAGC T TT C T AGAGTCTCCAAAT CCAGAAAACAAGGAC TAT GAAGAGCCAAAGAAAGTA 
CGGAAACCAGCTTTGACCGCCATTGAAGGCACAGCACATGGGGAGCCCTGCCACTTCCCTTTTCT 
H T T T CCTAGATAAGGAG TAT GAT GAAT GT AC AT C AGAT GGGAGGGAAGATGGCAGACTGTGGT GTG 

CTACAACCTATGACTACAAAGCAGATGAAAAGTGGGGCTTTTGTGAAACTGAAGAAGAGGCTGCT 
J:f AAGAGACGGCAGATGCAGGAAGCAGAAAT GATGTAT CAAACT GGAATGAAAAT CC T TAAT GGAAG 

7tf CAATAAGAAAAGCCAAAAAAGAGAAGCATATCGGTAT CT C CAAAAGGCAGCAAGCATGAACCATA 

If! CCAAAGCCCTGGAGAGAGTGTCATATGCTCTTTTATTTGGTGATTACTTGCCACAGAATATCCAG 

"4'? v, 

\U GCAGCGAGAGAGAT GT T T GAGAAGC T GACT GAGGAAGGC T C TCCCAAGGGACAGACTGCTCT TGG 
CTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTATATTATACAT 

% TTGGAGCTCTTGGGGGCAATCTAATAGCCCACATGGTTTTGGTAAGTAGACTTTAGTGGAAGGCT 

m AATAATATTAACATCAGAAGAATTTGTGGTTTATAGCGGCCACAACTTTTTCAGCTTTCATGATC 

Q CAGATTTGCTTGTATTAAGACCAAATATTCAGTTGAACTTCCTTCAAATTCTTGTTAATGGATAT 

W AACACAT GG7\ATC TACATGTAAATGAAAGT T GGTGGAGT CCACAAT T T TT CT TTAAAAT GAT TAG 

M TTTGGCTGATTGCCCCTAAAAAGAGAGATCTGATAAATGGCTCTTTTTAAATTTTCTCTGAGTTG 

f II 

GAAT T G T C AGAAT CAT T T T T T AC AT T AGAT TAT CAT AAT T T T AAAAAT T T T T C T T TAG T T T T T CA 
AAAT T T T G T AAAT G G T G G C T AT AG AAAAAC AAC AT G AAAT AT TAT AC AAT AT T T T GCAAC AAT GC 
CCTAAGAAT TGT TAAAAT T CATGGAG T TAT T TGTGCAGAAT GAC TCCAGAGAGC TCTACT T T CTG 
TTTTTTACTTTTCATGATTGGCTGTCTTCCCATTTATTCTGGTCATTTATTGCTAGTGACACTGT 
GCCTGCTTCCAGTAGTCTCATTTTCCCTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGG 
AAGAT TAAC TCAT T T T TAATAAAAT TAT GT CTAAGAT T AAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAW^AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 18 



MRYRIGLTLLLCAVLLSLASASSDEEGSQDESLDSKTTLTSDESVKDHTTAGRWAGQIFLDSEESEL 
ESSIQEEEDSLKSQEGESVTEDISFLESPNPENKDYEEPKKVRKPALTAIEGTAHGEPCHFPFLFLDK 
EYDECTSDGREDGRLWCATTYDYKADEKWGFCETEEEAAKRRQMQEAEMMYQTGMKILNGSNKKSQKR 
EAYRYLQKAASMNHTKALERVSYALLFGDYLPQNIQAAREMFEKLTEEGSPKGQTALGFLYASGLGVN 
S SQAKALVYYT FGALGGNLI AHMVLVSRL 



-An 



Important features: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 195-199, 217-221, 272-276 

Tyrosine kinase phosphorylation site. 

amino acids 220-228 



111 N-myristoylation sites. 

Q amino acids 120-126, 253-259, 268-274, 270-274, 285-291, 289-295 

Glycosaminoglycan attachment site. 

fVi 

s ^ amino acids 267-271 

Microbodies C-terminal targeting signal. 

amino acids 299-303 



Type II fibronectin collagen -binding domain protein. 

amino acids 127-169 

Fructose-bisphosphate aldolase class-II protein. 

amino acids 101-119 
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FIGURE 1Q 



AATTCAGAT TT TAAGCCCATTC TGCAGTGGAATT TCATGAACTAGCAAGAGGACACCATCTTCT T 
GTAT TATACAAGAAAGGAGTGTACCTATCACACACAGGGGGAAAAATGCTCT TT TGGGTGCTAGG 
CCTCCTAATCCTCTGTGGTTTTCTGTGGACTCGTAAAGGAAAACTAAAGATTGAAGACATCACTG 
ATAAGTACAT T T TTATCACTGGATGTGACT CGGGCT T T GGAAAC TTGGCAGCCAGAAC T T T T GAT 
AAAAAGGGATTTCATGTAATCGCTGCCTGTCTGACTGAATCAGGATCAACAGCTTTAAAGGCAGA 
AACC TCAGAGAGACT TCGTACT GTGC T T CT GGAT GT GACCGACCCAGAGAAT GT CAAGAGGACTG 
CCCAGTGGGTGAAGAACCAAGTTGGGGAGAAAGGTCTCTGGGGTCTGATCAATAATGCTGGTGTT 
CCCGGCGTGCTGGCTCCCACTGACTGGCTGACACTAGAGGACTACAGAGAACCTATTGAAGTGAA 
CC TGT TTGGAC TCATCAG T GTGACAC TAAATATGC T T CCT T T GGTCAAGAAAGCT CAAGGGAGAG 
TTATTAATGTCTCCAGTGTTGGAGGTCGCCTTGCAATCGTTGGAGGGGGCTATACTCCATCCAAA 
TATGCAGT GGAAGGT T TCAATGACAGC T TAAGACGGGACAT GAAAGC T T TT GGTGT GCACG T C TC 
I fl ATGCAT TGAACCAGGATTGT TCAAAACAAACTTGGCAGATCCAGTAAAGGTAATTGAAAAAAAAC 

%U TC GCCATT TGGGAGCAGC T G TC TCCAGACAT CAAACAACAATATGGAGAAGGT TACATT GAAAAA 

f a 

*?* AGTCTAGACAAACTGAAAGGCAATAAATCCTATGTGAACATGGACCTCTCTCCGGTGGTAGAGTG 
^ CATGGACCACGCTCTAACAAGTCTCTTCCCTAAGACTCATTATGCCGCTGGAAAAGATGCCAAAA 

T T T TC T GGAT ACC TC TGT C T CACAT GCC AG CAGC T T T GCAAGAC T TT T T AT TGT TGAAACAGAAA 
GCAGAGCTGGCTAATCCCAAGGCAGTGTGACTCAGCTAACCACAAATGTCTCCTCCAGGCTATGA 
AATTGGCCGATTTCAAGAACACATCTCCTTTTCAACCCCATTCCTTATCTGCTCCAACCTGGACT 
CATTTAGATCGTGCTTATTTGGATTGCAAAAGGGAGTCCCACCATCGCTGGTGGTATCCCAGGGT 
CCCTGCTCAAGTTTTCTTTGAAAAGGAGGGCTGGAATGGTACATCACATAGGCAAGTCCTGCCCT 
GTATTTAGGCTTTGCCTGCTTGGTGTGATGTAAGGGAAATTGAAAGACTTGCCCATTCAAAATGA 
TCTTTACCGTGGCCTGCCCCATGCTTATGGTCCCCAGCATTTACAGTAACTTGTGAATGTTAAGT 
ATCATCTCT TATCTAAATAT TAAAAGATAAGTCAACCCAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAA 



If! 
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FIGURE 20 



ML FWVLGLL I LCGFLWTRKGKLK IEDI TDKYI FI TGCDSGFGNLAART FDKKGFHVIAACLTES G 
STALKAETSERLRTVLLDVTDPENWRTAQWVKNQVGEKGLWGLINNAGVPGVLAPTDWLTLEDY 
REPIEVNLFGLISVTLNMLPLVKKAQGRVINVSSVGGRLAIVGGGYTPSKYAVEGFNDSLRRDMK 
AFGVRVSCIEPGLFKTNLADPVKVIEKKM 

LSPVVECMDHALTSLFPKTHYAAGKDAKIFWIPLSHMPAALQDFLLLKQKMIANPKAV 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 136-152 

N-glycosylation sites. 

amino acids 161-163, 187-190 and 253-256 

Glycosaminoglycan attachment site. 

amino acids 39-42 

N-myristoylation sites . 

amino acids 36-41, 42-47, 108-113, 166-171, 198-203 and 207-212 



APP_ID=10063594 



Page 172 of 320 



FIGURE 21 



CTGAGGCGGCGGTAGCATGGAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCGGCG 
CACTCGCTTTCCAGCACCTCAACACGGACTCGGACACGGAAGGTTTTCTTCTTGGGGAAGTAAAA 
GG T GAAG C C AAG AAC AG CAT T AC T GAT T C C C AAAT G GAT GAT G T T GAAG T T G T T T AT AC AAT T G A 
CAT TCAGAAATATAT TCCATGC TATCAGCT T T T TAGCT T T TATAAT T CT T CAGGCGAAGTAAATG 
AGCAAGCACT GAAGAAAATAT TATCAAATGT CAAAAAGAAT GT GGTAGGT T GGTACAAAT TCCGT 
CGTCAT T CAGATCAGAT CATGACGT T TAGAGAGAGGC TGCT TCACAAAAACT T GCAGGAGCATT T 
TTCAAACCAAGACCTTGTTTTTCTGCTATTAACACCAAGTATAATAACAGAAAGCTGCTCTACTC 
jM A T C G AC T G G AACAT T C C T T AT AT AAAC C T C AAAAAG G AC T T T T T C AC AGG G T AC C T T TAG T G G T T 

CI GCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAACTGTATCAGGTTCCTGTATGTCCACTGG 
% f T TT TAGC CGAGCAGTACAAACACACAGC T CT AAAT T T T T T GAAGAAGATGGATCCT TAAAGGAGG 

T I TACATAAGATAAAT GAAATGTATGC T TCAT T ACAAGAGGAAT TAAAGAGT ATAT GCAAAAAAGTG 

tfl GAAGACAGTGAACAAGCAGTAGATAAACTAGTAAAGGATGTAAACAGATTAAAACGAGAAATTGA 
V3 GAAAAGGAGAGGAGCACAGAT T CAGGCAGCAAGAGAGAAGAACATCCAAAAAGACCCTCAGGAGA 

ACATTTTTCTTTGTCAGGCATTACGGACCTTTTTTCCAAATTCTGAATTTCTTCATTCATGTGTT 
!^ ATGTCTTTAAAAAATAGACATGTTTCTAAAAGTAGCTGTAACTACAACCACCATCTCGATGTAGT 
AGACAATCTGACCTTAATGGTAGAACACACTGACATTCCTGAAGCTAGTCCAGCTAGTACACCAC 
J AAAT CAT T AAGCATAAAGCCT TAGAC TT AGAT GACAGATGGCAAT T CAAGAGATC TCGGT T GT TA 

y GATACACAAGACAAACGAT CTAAAGCAAATAC TGG TAGTAGT AACCAAGAT AAAGCATCCAAAAT 

3 GAGCAGCC CAGAAACAGAT GAAGAAATT GAAAAGAT GAAGGGT T T T GGTGAATAT TCACGGT CTC 

CTACAT TT TGAT CC T T TTAACC T TACAAGGAGAT TT T T T TAT T TGGC T GAT GGGTAAAGCCAAAC 
ATTTCTATTGTTTTTACTATGTTGAGCTACTTGCAGTAAGTTCATTTGTTTTTACTATGTTCACC 
TGTTTGCAGTAATACACAGATAACTCTTAGTGCATTTACTTCACAAAGTACTTTTTCAAACATCA 
GATGCTTTTATTTCCAAACCTTTTTTTCACCTTTCACTAAGTTGTTGAGGGGAAGGCTTACACAG 
AC AC AT T C T T TAG AAT T G G AAAAG T GAG AC C AG GCAC AG T G G C T C AC AC C T G T AAT C C C AG C AC T 
TAGGGAAGACAAGTCAGGAGGAT TGAT T GAAGC TAGGAGT TAGAGAC CAGCCT GGGCAACGTAT T 
GAGACCATGTCTATTAAAAAATAAAATGGAAAAGCAAGAATAGCCTTAT T TTCAAAATAT GGAAA 
GAAATTTATATGAAAATTTATCTGAGTCAT TAAAAT TCTCCTTAAGTGATACT TT TT TAGAAGTA 
CATTAT GGCTAGAGT T GCCAGATAAAAT GC TGGATATCAT GCAATAAAT T T GCAAAACAT CATCT 
AAAATTTAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 22 



MEGESTSAVLSGFVLGALAFQHLNTDSDTEGFLLGEVKGEAKNSITDSQMDDVEWYTIDIQKYI 
PCYQL FS FYNS S GE VNEQALKK I LSNVKKNWGWYKFRRHS DQ IMT FRERLLHKNLQEH FSNQDL 
VFLLLTPSIITESCSTHRLEHSLYKPQKGLFHRVPLWMTLGMSEQLGYKTVSGSCMSTGFSRAV 
QTHSSKFFEEDGSLKEVHKINEMYASLQEELKSICKKVEDSEQAVDKLVKDVNRLKREIEKRRGA 
Q I QAAREKNI QKDPQENI FLCQALRT FFPNSEFLHS CVMS LKNRHVSKSS CNYNHHLDWDNLTL 
MVEHTDI PEAS PAS T PQI I KHKALDLDDRWQFKRSRLLDTQDKRSKANTGS SNQDKASKMS S PET 
DEE I EKMKG FGE YS RS PT F 

Important features: 
Signal peptide: 

amino acids 1-19 



N-glycosylation sites. 

amino acids 75-79, 322-326 

N-myristoylation site. 

amino acids 184-154 

Growth factor and cytokines receptors family, 
amino acids 134-150 
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FIGURE 23 



GGCACAGCCGCGCGGCGGAGGGCAGAGTCAGCCGAGCCGAGTCCAGCCGGACGAGCGGACCAGCGCAGGGCAGCCCAA 
GCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCCCCGCGGCGCCTGCCACCCTTCCCTCCTTCCCC 
GCGTCCCCGCCTCGCCGGCCAGTCAGCTTGCCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCT 
GCTTCCCTGGGCCGCGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCTGA 
CGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCAACTCCAACTCCTTCTCCC 
TCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCCGCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAA 
GGTGGGAACGCGTCCGCCCCGGCCCGCACCATGGCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTC 
AGCGCCGCGCTGCTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGGCTTC 
AACAAGAACGATGCCCCCCTCCACGAGATCAACGGTGATCATTTGAAGATCTGTCCCCAGGGTTCTACCTGCTGCTCT 
H CAAGAGATGGAGGAGAAGTACAGCCT GCAAAGTAAAGAT GAT TTCAAAAGT GT G GTCAGCGAACAGT GCAAT CATTT G 

%%%> CAAGCTGTCTTTGCTTCACGTTACAAGAAGTTTGATGAATTCTTCAAAGAACTACTTGAAAATGCAGAGAAATCCCTG 
Q AATGATAT GTTTGT GAAGACATATGGC CATTTATACAT GCAAAAT T CTGAGCTATTTAAAGAT CT CTT C GTAGAGTTG 

y» AAACGTTACTACGTGGTGGGAAATGTGAACCTGGAAGAAATGCTAAATGACTTCTGGGCTCGCCTCCTGGAGCGGATG 



U! 



TTCCGCCTGGTGAACTCCCAGTACCACTTTACAGATGAGTATCTGGAATGTGTGAGCAAGTATACGGAGCAGCTGAAG 
CCCTTCGGAGATGTCCCTCGCAAATTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGC 
TTAGCGGTTGCGGGAGATGTCGTGAGCAAGGTCTCCGTGGTAAACCCCACAGCCCAGTGTACCCATGCCCTGTTGAAG 
4* AT GAT CT ACT GCT CCCACTGCCGGGGTCT CGTGACTGT GAAG CCAT GTTACAACTACT GCT C AAACATCAT GAGAGGC 

« TGTTTGGCCAACCAAGGGGATCTCGATTTTGAATGGAACAATTTCATAGATGCTATGCTGATGGTGGCAGAGAGGCTA 
CJ GAGGGT CCTTT CAACATT GAAT CG GT CAT GGAT CCCAT C GAT GTGAAGATTTCT GATGCTATTAT GAACAT GCAGGAT 

41! AATAGTGTTCAAGTGTCTCAGAAGGTTTTCCAGGGATGTGGACCCCCCAAGCCCCTCCCAGCTGGACGAATTTCTCGT 
TCCATCTCTGAAAGTGCCTTCAGTGCTCGCTTCAGACCACATCACCCCGAGGAACGCCCAACCACAGCAGCTGGCACT 
AGTTTGGACCGACTGGTTACTGATGTCAAGGAGAAACTGAAACAGGCCAAGAAATTCTGGTCCTCCCTTCCGAGCAAC 
^1 GTTTGCAACGATGAGAGGATGGCTGCAGGAAACGGCAATGAGGATGACTGTTGGAATGGGAAAGGCAAAAGCAGGTAC 
III CT GTTT GCAGT GACAGGAAAT GGATTAGC CAACCAGG GCAACAACC CAGAGGT C CAGGTTGACAC CAGCAAACCAGAC 

ATACTGATCCTTC GTCAAATCATGGCTCTTCGAGTGATGACCAGCAAGATGAAGAATGCATACAAT GGGAACGACGTG 
GACTTCTTTGATATCAGTGATGAAAGTAGTGGAGAAGGAAGTGGAAGTGGCTGTGAGTATCAGCAGTGCCCTTCAGAG 
TTTGACTACAAT GCCACT GACCAT GCTG GGAAGAGTGCCAAT GAGAAAGC CGACAGTGCTG GT GTC C GT CCTGGGGCA 
CAGGCCTACCT C CT CACT GT CTTCT GCAT C TT GT T CCT GGTTATGCAGAGAGAGTGGAGATAATT CT CAAACT CT GAG 
AAAAAGTGTT CAT CAAAAAGTTAAAAGGC ACC AGT TAT CACT TTT CT ACCATCCT AGT GACTTTGCTTTTTAAAT GAA 
TGGAGAACAATGTACAGTTTTTACTATGTGGCCACTGGTTTAAGAA 

AGGAAAAGGGACTGTGCATTGAGTTGGTTCCTGCTCCCCCAAACCATGTTAAACGTGGCTAACAGTGTAGGTACAGAA 
CTATAGTTAGTTGTGCATTTGTGATTTTATCACTCTATTATTTGTTTGTATGTTTTTTTCTCATTTCGTTTGTGGGTT 
TT TTT TTC CAACT GT GAT CT CGCCTT GTTTCTT AC AAGCAAACCAGGGT CCCT T CTTGGCAC GT AACAT GTACGTATT 
T C TGAAATATTAAATAGCTGTACAGAAGCAGGT TTTATTTAT CAT GTTAT CTTATTAAAAGAAAAAGCCCAAAAAGC 
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FIGURE 24 



*W Signal peptide: 



MARFGLPALLCTLAVLSAALLAAELKSKSCSEVRRLYVSKGFNKNDAPLHEINGDHLKICPQGST 
CCS QEMEEKYS LQSKDDFKS WSEQCNHLQAVFAS RYKKFDE FFKELLENAEKSLNDMFVKT YGH 
LYMQNSELFKDLFVELKRYYWGNVNLEEMLNDFWARLLERMFRLVNSQYHFTDEYLECVSKYTE 
QLKPFGDVPRKLKLQVTRAFVAART FAQGLAVAGDWSKVSVVNPTAQCTHSJjLPCMI YCSHCRGL 
VTVKPCYNYCSNIMRGCIANQGDLDFEWNNFIDAMI^AERLEGPFNIESVMDPIDWISDAI^ 
MQDNSVQVSQKVFQGCGPPKPLPAGRISRSISESAFSARFRPHHPEERPTTAAGTSLDRLVTDVK 
EKLKQAKKFWSSLPSNVCNDERMAAGNGNEDDCWNGKGKSRYLFAVTGNGLANQGNNPEVQVDTS 
KPDILILRQIMALRVMTSKMKNAYNGNDVDFFDISDESSGEGSGSGCEYQQCPSEFDYNATDHAG 
KS ANEKADS AGVRPGAQAYLLTVFC I L FLVMQREWR 

Important features: 



amino acids 1-22 



0 ATP/GTP-binding site motif A (P-loop) 

Ul amino acids 515-524 



N-glycosylation site. 

amino acids 514-518 

Glycosaminoglycan attachment sites. 

amino acids 494-498, 498-502 



N-myristoylation sites. 

amino acids 63-69, 224-230, 276-282, 438-444, 497-503, 531-537 
Glypicans proteins. 

amino acids 54-75, 105-157, 238-280, 309-346, 423-460, 468-506 
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FIGURE 2* 

CTCGCCCTCAAATGGGAACGCTGGCCTGGGACTAAAGCATAGACCACCAGGCTGAGTATCCTGAC 
CTGAGTCATCCCCAGGGATCAGGAGCCTCCAGCAGGGAACCTTCCATTATATTCTTCAAGCAACT 
TACAGCTGCACCGACAGTTGCGATGAAAGTTCTAATCTGTTCCCTCCTCCTGTTGCTGCCACTAA 
TGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCACAGGGACCGAGGC 
CAGGCTTCTAGGAGAT GGC TCCAGGAAGGCGGCCAAGAAT G T GAGTGCAAAGAT TGGT T CCT GAG 
AGCCCCGAGAAGAAAATTCATGAGAGTGTCTGGGCTGCCAAAGAAGCAGTGCCCCTGTGATCATT 
T CAAGGGCAATGTGAAGAAAACAAGACACCAAAGGCACCACAGAAAGCCAAACAAGCAT TCCAGA 
GCCTGCCAGCAATTTCTCAAACAATGTCAGCTAAGAAGCTTTGCTCTGCCTTTGTAGGAGCTCTG 
AGCGCCCACTC T TCCAAT TAAACAT T CT CAGCCAAGAAGACAGT GAGCACACCTACCAGACAC TC 
TTCTTCTCCCACCTCACTCTCCCACTGTACCCACCCCTAAATCATTCCAGTGCTCTCAAAAAGCA 
TGTTTTTCAAGATCATTTTGTTTGTTGCTCTCTCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCT 
GTGCCCTCCCCTTACCCAGGCTTAGGCTTAATTACCTGAAAGATTCCAGGAAACTGTAGCTTCCT 
AGC TAGT GT CATT TAACCT TAAATGCAAT CAGGAAAGTAGCAAACAGAAGT CAATAAAT AT TT TT 
AAATGTCAAAAAAAAAAAAAAAAAA 
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FIGURE 26 

MKVLISSLLLLLPmLMSMVSSSLNPGVARGHRDRGQASRRWLQEGGQECECKDWFLRAPRRKFM 
TVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 

Important features: 
Signal peptide: 

amino acids 1-22 

N-myristoylation sites. 

amino acids 27-33, 46-52 
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FIGURE 27 



GGACGCCAGCGCCTGCAGAGGCTGAGCAGGGAAAAAGCCAGTGCCCCAGCGGAAGCACAGCTCAG 
AGCTGGTCTGCCATGGACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTACCCTGCC 
CCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCCCCTACCTGA 
TGGCCGTGCTGACTCCCAAGAGCAACCGCAAGATGGAGAGCAAGAAACGGGAGCTCTTCAGCCAG 
ATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGGCTGCGGAACCGGAGC 
CAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACCCAAATCCCCACTTTGAGA 
AGTTCCTGACAAAGAGCATGGCTGAGAACAGGCACCTCCAATATGAGCGGTTTGTGGTGGCTCCT 
GGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGTGGTGGTCTGCACTCTGGTGCTGTG 
1% CTCTGTGCAGAGCCCAAGGAAGGTCCTGCAGGAGGTCCGGAGAGTACTGAGACCGGGAGGTGTGC 
fp TCTTTTTCTGGGAGCATGTGGCAGAACCATATGGAAGCTGGGCCTTCATGTGGCAGCAAGTTTTC 
kj GAGCCCAC CTGGAAACACAT TGGGGAT GGC TGC TGCC TCACCAGAGAGACC TGGAAGGAT CT TGA 

HI GAACGCCCAGTTCTCCGAAATCCAAATGGAACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTG 
^ GGCCCCACATCATGGGAAAGGCTGTCAAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCC 

T TCCCCAGCC T CCAAT TAGAACAAGCCACCCAC CAGC CTAT CTAT CT T C CACTGAGAGGGACC TA 
Q GCAGAATGAGAGAAGACATTCATGTACCACCTACTAGTCCCTCTCTCCCCAACCTCTGCCAGGGC 
l|l AATCTCTAACTTGAATCCCGCCTTCGACAGTGAAAAAGCTCTACTTCTACGCTGACCCAGGGAGG 
tl AAACACTAGGACCCTGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTC 

CCAATGTTGTCCCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACAC 

O 

yes CCATGCGTCTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGAC 

CCTCTCTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGG 
AT GCCAGAGCAAGAC TCAAAGAGGCAGAGGT T T T GT T CT CAAATATT T T T TAATAAATAGAC GAA 
ACCACG 
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FIGURE 28 



MDI LVPLLQLLVLLLTLPLHLMALLGCWQPLCKS YFPYLMAVLT PKSNRKME S KKREL FSQI KGL 
TGASGKVALLELGCGTGANFQFYPPGCRVTCLDPNPHFEKFLTKSMAENRHLQYERFVVAPGEDM 
RQLADGSMDVWCTLVLCS VQS PRKVLQEVRRVLRPGGVL FFWEHVAE PYGSWAFMWQQVFE PTW 

KHIGDGCCLTRETWKDLENAQFSEIQMERQPPPLKWLPVGPHIMGKAVKQSFPSSKALICSFPSL 
QLEQATHQPIYLPLRGT 

Important features: 
Signal peptide: 

amino acids 1-23 

Leucine zipper pattern. 

amino acids 10-32 

N-myristoylation sites. 

amino acids 64-70, 78-84, 80-86, 91-97, 201-207 
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FIGURE 2Q 

CAATGTTTGCCTATCCACCTCCCCCAAGCCCCTTTACCTATGCTGCTGCTAACGCTGCTGCTGCT 
GCTGCTGCTGCTTAAAGGCTCATGCTTGGAGTGGGGACTGGTCGGTGCCCAGAAAGTCTCTTCTG 
CCACTGACGCCCCCATCAGGGATTGGGCCTTCTTTCCCCCTTCCTTTCTGTGTCTCCTGCCTCAT 
CGGCCTGCCATGACCTGCAGCCAAGCCCAGCCCCGTGGGGAAGGGGAGAAAGTGGGGGATGGCTA 
AGAAAGCTGGGAGATAGGGAACAGAAGAGGGTAGTGGGTGGGCTAGGGGGGCTGCCTTATTTAAA 
GTGGTTGTTTATGATTCTTATACTAATTTATACAAAGATATTAAGGCCCTGTTCATTAAGAAATT 
GTTCCCTTCCCCTGTGTTCAATGTTTGTAAAGATTGTTCTGTGTAAATATGTCTTTATAATAAAC 
AGTTAAAAGCTGAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE so 

MLLLTLLLLLLLLKGSCLEWGLVGAQKVSSATDAPIRDWAFFPPSFLCLLPHRPAMTCSQAQPRG 
EGEKVGDG 

Important features: 
Signal peptide: 

amino acids 1-15 

Growth factor and cytokines receptors family: 

amino acids 3-18 
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FIGURE 31 

GTTTGAATTCCTTCAACTATACCCACAGTCCAAAAGCAGACTCACTGTGTCCCAGGCTACCAGTT 
CCTCCAAGCAAGTCATTTCCCTTATTTAACCGATGTGTCCCTCAAACACCTGAGTGCTACTCCCT 
ATTTGCATCTGTTTTGATAAATGATGTTGACACCCTCCACCGAATTCTAAGTGGAATCATGTCGG 
GAAGAGATACAATCCTTGGCCTGTGTATCCTCGCATTAGCCTTGTCTTTGGCCATGATGTTTACC 
TTCAGATTCATCACCACCCTTCTGGTTCACATTTTCATTTCATTGGTTATTTTGGGATTGTTGTT 
TGTCTGCGGTGTTT TAT GGT GGC TGTAT TAT GAC TAT ACCAACGACC TCAGC ATAGAAT T GGACA 
CAGAAAGGGAAAATATGAAGTGCGTGCTGGGGTTTGCTATCGTATCCACAGGCATCACGGCAGTG 
C TGCTCGTC TTGATT T T TGTTC TCAGAAAGAGAATAAAAT T GACAGT TGAGC T T T TCCAAAT CAC 
AAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCCAGCCACTGTGGACATTTGCCATCCTCA 
TTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTGAGCCTGGGAACTGCAGGAGCTGCCCAGGTT 
ATGGAAGGCGGCCAAGTGGAATATAAGCCCCTTTCGGGCATTCGGTACATGTGGTCGTACCATTT 
AATTGGCCTCATCTGGACTAGTGAATTCATCCTTGCGTGCCAGCAAATGACTATAGCTGGGGCAG 
TGGTTACTTGTTATTTCAACAGAAGTAAAAATGATCCTCCTGATCATCCCATCCTTTCGTCTCTC 
TCCATTCTCTTCTTCTACCATCAAGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAG 
GATTCCGAGAATCAT TGT CATGTACATGCAAAACGCACTGAAAGAACAGCAGCATGGTGCAT TGT 
CCAGGTACCTGTTCCGATGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTC 
AACCAGAATGCATATACT ACAAC TGC TAT TAAT GGGACAGAT T TC TG T ACAT CAGCAAAAGATGC 
ATTCAAAATCTTGTCCAAGAACTCAAGTCACTTTACATCTATTAACTGCTTTGGAGACTTCAT7\A 
TTTTTCTAGGAAAGGTGTTAGTGGTGTGTTTCACTGTTTTTGGAGGACTCATGGCTTTTAACTAC 
AATCGGGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGC 
CCATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGATC 
TGGAAACAAATGATGGATCGTCAGAAAAGCCCTACTTTATGGATCAAGAATTTCTGAGTTTCGTA 
AAAAG GAG CAACAAAT T AAACAAT GCAAG G GCAC AG C AG GACAAGCAC T CAT T AAGGAAT GAG GA 
GGGAACAGAAC TCCAGGCCATTGTGAGATAGATACCCATT TAGGTATCTGTACCTGGAAAACATT 
TCC T T CTAAGAGCCAT T TACAGAATAGAAGATGAGACCAC TAGAGAAAAGT TAGTGAAT x T T TT T 
T TAAAAGACCTAATAAACCCTAT TC T T CCT CAAAA 
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FIGURE 32 

MS GRDT I LGLC I LALALSLAMMFT FRFI T TLLVH I FI SLVI LGLLFVCGVLWWL YYDYTNDLS I E 
LDTEREMMKCVLGFAI VSTG I TAVLLVL I FVLRKRI KLT VELFQI TNKAI S SAPFLLFQPLWT FA 
ILIFFWLWVAVLLSLGTAGAAQVMEGGQVEYKPLSGIRYMWSYHLIGLIWTSEFILACQQMTIA 
GAWTCYFNRSKNDPPDHPILSSLSILFFYHQGTWKGSFLISWRIPRIIVMYMQNALKEQQHG 
ALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKILSKNSSHFTSINCFGD 
FIIFLGKVLWCFTVFGGLMAFNYNRAFQVWAVPLLLVAFFAYLVAHSFLSVFETVLDALFLCFA 
1^ VDLETNDGSSEKPYFMDQEFLSFVKRSNKLNNARAQQDKHSLRNEEGTELQAIVR 

Csl Important features : 

fl Signal peptide: 

amino acids 1-20 

m 

«K Putative transmembrane domains: 

* w amino acids 35-54, 75-97, 126-146, 185-204, 333-350, 352-371 

W 

f*l N-glycosyiation sites* 

|J amino acids 204-208, 295-299, 313-317 

S3 

*W N-myristoylation sites. 

amino acids 147-153, 178-184, 196-202, 296-275, 342-348 
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FIGURE aa 



GTTCGATTAGCTCCTCTGAGAAGAAGAGAAAAGGTTCTTGGACCTCTCCCTGTTTCTTCCTTAGA 
ATAATTTGTATGGGATTTGTGATGCAGGAAAGCCTAAGGGAAAAAGAATATTCATTCTGTGTGGT 
GAAAAT TT T T T GAAAAAAAAAT T GCC T T CT T CAAACAAGGG TGT CAT TCTGATAT T TATGAGGAC 
TGTTGTTCTCACTATGAAGGCATCTGTTATTGAAATGTTCCTTGTTTTGCTGGTGACTGGAGTAC 
AT TCAAACAAAGAAACGGCAAAGAAGAT TAAAAGGCCCAAGTT CACT GTGC CTCAGAT CAAC T GC 
GATGT CAAAGC CGGAAAGAT CAT CGAT CCT GAGT T CAT TGT GAAATGT CCAGCAGGAT GCCAAGA 
CCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACTCCAGTGTGTGTGGCGCTGCCG 
TACACAGTGGTGTGCTTGATAATTCAGGAGGGAAAATACTTGTTCGGAAGGTTGCTGGACAGTCT 
GGTTACAAAGGGAGT TAT T CCAACGG T GT CCAAT CGT TAT C CCTACCACGAT GGAGAGAATC CTT 
TATCGT CT T AGAAAGTAAACCCAAAAAGGGT GTAACC TACCCAT CAGCTCT TACATACT CAT CAT 
H CGAAAAGT CCAGC TGCCCAAGCAGGT GAGACCACAAAAGCCTAT CAGAGGC CACC TAT T CCAGGG 

Q ACAACTGCACAGCCGGTCACTCTGATGCAGCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCAC 
f*| CACCTTGCCAAGGCCATCCCCTTCTGCTGCTTCTACCACCAGCATCCCCAGACCACAATCAGTGG 
% GCCACAGGAGCCAGGAGAT GGAT C TC T GGT CCAC TGCCACC TACACAAGCAGCCAAAACAGGCCC 

fl AGAGCT GAT CCAGGTATC CAAAGGCAAGAT CC T TCAGGAGC TGCC TTCCAGAAACC TGT TGGAGC 

W G GATGT CAGCC TGGGACTT GTT CCAAAAGAAGAATTGAGCACACAGT C T T T GGAGCCAGTAT CCC 

|f| TGGGAGAT CCAAAC TGCAAAAT T GAC T TG T CGT T TT TAAT T GATGGGAGCACCAGCATT GGCAAA 

■Jj CGGCGATTCCGAATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGC 
~% CGGTCCACTGATGGGTGTTGTCCAGTATGGAGACAACCCTGCTACTCACTTTAACCTCAAGACAC 
*^ ACACGAAT T CTCGAGAT C T GAAGACAGCCATAGAGAAAAT TAC TCAGAGAGGAGGAC TT T CTAAT 

* GTAGGT CGGGCCATC TCCT T TGT GACCAAGAAC T TCT T T TC CAAAGC CAAT GGAAACAGAAGCGG 

Q GGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCTTCAA 
'ill GACT TGCGAGAGAGTCAGGAATCAACATTTTCT TCAT CACCAT TGAAGGTGCTGCTGAAAATGAG 

fi AAGCAGTATGTGGTGGAGCCCAACTTTGCAAACAAGGCCGTGTGCAGAACAAACGGCTTCTACTC 

GCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTGAAGCGGGTCTGCG 
ACACTGACCGCCTGGCCTGCAGCAAGACCTGCTTGAACTCGGCTGACATTGGCTTCGTCATCGAC 
O GGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGTTTGTGACCAACCTCACCAA 
f|| AGAGTTTGAGATTTCCGACACGGACACGCGCATCGGGGCCGTGCAGTACACCTACGAACAGCGGC 

TGGAGTTTGGGTTCGACAAGTACAGCAGCAAGCCTGACATCCTCAACGCCATCAAGAGGGTGGGC 
TACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCAACTTCGCCCTGGAGCAGCTCTTCAAGAA 
GTCCAAGCC CAACAAGAGGAAG T TAAT GAT CCTCAT CACCGACGGGAGG T CCTACGACGACGT CC 
GGATCCCAGCCATGGCTGCCCATCTGAAGGGAGTGATCACCTATGCGATAGGCGTTGCCTGGGCT 
GCCCAAGAGGAGCTAGAAGTCATTGCCACTCACCCCGCCAGAGACCACTCCTTCTTTGTGGACGA 
GT TTGACAACC TCCAT CAGTATGTCCCCAGGATCATCCAGAACAT T TGTACAGAGT TCAACT CAC 
AGCCTCGGAACTGAATTCAGAGCAGGCAGAGCACCAGCAAGTGCTGCTTTACTAACTGACGTGTT 
GGACCACCCCACCGCTTAATGGGGCACGCACGGTGCATCAAGTCTTGGGCAGGGCATGGAGAAAC 
AAAT GT CT T GT TATTAT T C TT TGCCAT CAT GC T T T T T CATAT TC CAAAAC T T GGAGT TACAAAGA 
T GATCACAAACGTATAGAATGAGCCAAAAGGCTACAT CAT GT TGAGGG TGCT GGAGATT T TACAT 
T T TGACAAT T G T T T T CAAAAT AAAT G T T C G GAATACAG T G C AG C C C T TAC GAC AGG C T T AC GT AG 
AGCTTTTGTGAGATTTTTAAGTTGTTATTTCTGATTTGAACTCTGTAACCCTCAGCAAGTTTCAT 
T T T TGTCAT GACAAT GT AGGAAT TGCT GAAT TAAATGT TT AGAAGGATGAAAAATAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAA?U\AAAAAAM 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA?U\AAAG 



APP_ID=1 0063594 



Page 185 of 320 



FIGURE 34 

MRTWLTMKASVIEMFLVLLVTGVHSNKETAKKIKRPKFTVPQINCDVKAGKIIDPEFIVKCPAG 
CQDPKYHVYGTDVYASYSSVCGAAVHSGVLDNSGGKILVRKVAGQSGYKGSYSNGVQSLSLPRWR 
E S FI VLE SKPKKGVTYPSALTYS SSKS PAAQAGET TKAYQRPP I PGT TAQPVTLMQLLAVTVAVA 
TPTTLPRPSPSAASTTSIPRPQSVGHRSQEMDLWSTATYTSSQNRPRADPGIQRQDPSGAAFQKP 
VGADVSLGLVPKEELSTQSLEPV.SLGDPNCKIDLSFLIDGSTSIGKRRFRIQKQLL7VDVAQALDI 
GPAGPLMGWQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANGN 
RSGAPNVVVVMVDGWPTDKVEEAS RLARE S GINI FF I T IEGAAENEKQYWE PNFANKAVCRTNG 
FYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSSSVGTGNFRTVLQFVTN 
LTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQL 
FKKSKPNKRKLMI L I TDGRS YDDVRI PAMAAHLKGVT TYAI GVAWAAQEELEVIATHPARDHS FF 
VDEFDNLHQYVPRIIQNICTEFNSQPRN 

Important features: 
Signal peptide: 

amino acids 1-2 6 

Transmembrane domain: 

amino acids 181-200 

N-glycosylation sites. 

amino acids 390-394, 520-524 

N-myristoylation sites. 

amino acids 23-29, 93-99, 115-121, 262-268, 367-373, 389-395, 
431-437, 466-472, 509-515, 570-576, 571-577, 575-581, 627-633 

Amidation si te . 

amino acids 304-308 
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FIGURE 



CCGAGC ACAGGAGAT T GC CTGC GT TT AGGAGGT G GC TGCGTT GT GGGAAAAGCT AT CAAGGAAGAAAT TGC 
C AAACC ATGT CT T TT T TT CTGT TT TCAGAGT AGT TC AC AAC AGATCT GAGTGTT TT AATT AAGCAT GGAAT 
ACAGAAAACAACAAAAAACTTAAGCTTTAATTTCATCTGGAATTCCACAGTTTTCTTAGCTCCCTGGACCC 
GGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTATCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTA 
AAGAACCTTCGGCTCGCGTGCTTCTGAGCTGCTGTGGATGGCCTCGGCTCTCTGGACTGTCCTTCCGAGTA 
GG AT GT CACT GAGAT C CCTC AAAT GGAGCC TCCT GCTGC T GT C ACT CC TGAGTT TCT TTGTGATGT GGT AC 
CTCAGCCTTCCCCACTACAATGTGATAGAACGCGTGAACTGGATGTACTTCTATGAGTATGAGCCGATTTA 
C AGACAAGACTT TC ACTT CAC ACT TCGAGAGCAT TC AAACTGCTCT C ATCAAAATC C AT T T C TGGT CATT C 
TGGTGACCTCCCACCCTTCAGATGTGAAAGCCAGGCAGGCCATTAGAGTTACTTGGGGTGAAAAAAAGTCT 
Jtf TGGT GGGGAT AT GAGGTT CTT ACATTTT TC TT AT T AGG CC AAGAGGCT GAAAAGGAAGAC AAAATGTT GGC 

T I AT T GTCCTT AGAGGAT GAAC AC CT TCTT T ATGGT GAC ATAAT CCGAC AAGAT TT TT T AGACACAT ATAAT A 

ill 

|f J ACCTGACCTTGAAAACCATTATGGCATTCAGGTGGGTAACTGAGTTTTGCCCCAATGCCAAGTACGTAATG 

AAGACAGAC ACT GAT GTT TTCATC AAT ACT GGC AAT TT AGT G AAGT AT CT T T TAAACCT AAACC AC TCAGA 

J* gaagtttttcacaggttatcctctaattgataattattcctatagaggattttaccaaa?^ 
3 cttaccaggagtatcctttcaaggtgttccctccatactgcagtgggttgggttatataatgtccagagat 
O ttggtgccaaggatctatgaaatgatgggtcacgtaaaacccatcaagtttgaagatgtttatgtcgggat 

ctgtttgaatttattaaaagtgaacattcatattccagaagacacaaatcttttctttctatatagaatcc 
atttggatgtctgtcaactgagacgtgtgattgcagcccatggcttttcttccaaggagatcatcactttt 
tggcaggtcatgctaaggaacaccacatgccattattaacttcacattctacaaaaagcctagaaggacag 
gataccttgtggaaagtgttaaataaagtaggtactgtggaaaattcatggggaggtcagtgtgctggctt 
acactgaactgaaactcatgaaaaacccagactggagactggagggttacacttgtgatttattagtcagg 
cccttcaaagatgatatgtggaggaattaaatataaaggaattggaggtttttgctaaagaaattaatagg 
accaaacaatttggacatgtcattctgtagactagaatttcttaaaagggtgttactgagttataagctca 
ctaggctgtaaaaacaaaacaatgtagagttttatttattgaacaatgtagtcacttgaaggttttgtgta 
tatcttatgtggattaccaatttaaaaatatatgtagttctgtgtcaaaaaacttcttcactgaagttata 
ctgaacaaaattttacctgtttttggtcatttataaagtacttcaagatgttgcagtatttcacagttatt 
attatttaaaattacttcaactttgtgtttttaaatgttttgacgatttcaatacaagataaaaaggatag 
tgaatcattctttacatgcaaacattttccagttacttaactgatcagtttattattgatacatcactcca 
ttaatgtaaagtcataggtcattattgcatatcagtaatctcttggactttgttaaatattttactgtggt 
aatatagagaagaattaaagcaagaaaatctgaaaa 



w 

3S 
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FIGURE 16 



MASALWTVLPSRMSLRSLKWSLLLLSLLSFFVMWYLSLPHYNVIERVNWMYFYEYEPIYRQDFHF 
TLREHSNCSHQNPFLVILVTSHPSDVKARQAIRVTWGEKKSWWGYEVLTFFLLGQEAEKEDKMLA 
L S LEDEHLLYGD 1 1 RQDFLDT YNNLTLKT IMAFRWVTE FCPNAKYVMKTDTDVFINTGNLVKYLL 
NLNHSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRDLVPRIYEMMGHV 
KP I K FEDVYVG I C LNLLKVN I H I P E D TNL FFL YR I HL DVC QLRRV I AAHG FS S KE 1 1 T FWQVML R 
NTTCHY 

Important features: 

Type II transmembrane domain: 

amino acids 20-39 



tjl N-glycosylation sites. 

amino acids 72-76, 154-158, 198-202, 212-216, 326-330 



Glycosaminoglycan attachment site. 



III amino acids 239-243 



Ly-6 / u-PAR domain proteins. 

amino acids 23-37 

N-myristoylation site. 

amino acids 271-277 
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FIGURE 37 



CGCTCGGGCACCAGCCGCGGGAAG GATGG AGCTGGGTTGCTGGACGCAGTTGGGGCTCACTTTTCTTCAGCTCCTTCTCATC 
TCGTCCTTGCCAAGAGAGTACACAGTCATTAATGAAGCCTGCCCTGGAGCAGAGTGGAATATCATGTGTCGGGAGTGCTGTG 
AATATGATCAGAT TGAGTGCGTC TGCC C C GGAAAGAGGGAAGTCGTGGGTTATAC CATC CC T T GCTGC AGGAATGAGGAGAA 
TGAGTGTGACTCCTGCCTGATCCACCCAGGTTGTACCATCTTTGAAAACTGCAAGAGCTGCCGAAATGGCTCATGGGGGGGT 
ACCT TGGATGACTTC TATGT GAAGGGGT TCTAC TG TGCAGAGTGCCGAGCAGGCT GGTACGGAGGAGAC TGCATGC GATGTG 
GCCAGGT TC TGC GAGCC CC AAAGGGTCAGAT T T TG T TGGAAAGC TAT C C CCTAAAT GCTCAC TGTGAATGGACCAT TCAT GC 
TAAACCTGGGT T T GTCATCC AACTAAGAT TT GT CATGT TGAGTC TGGAGT T T GACTACATGTGCCAGTATGAC TATGT TGAG 
GTTCGTGATGGAGACAACCGC GATGGC C AGAT C ATCAAGCGTGTC TGT GGCAACGAGCGGC CAGC TCC T ATCCAGAGCATAG 
jy K GATCCTCACTCCACGTCCTCTTCCACTCCGATGGCTCCAAGAATTTTGACGGTTTCCATGCCATTTATGAGGAGATCACAGC 
|T*| ATGCTCCTCATCCCCTTGTTTCCATGACGGCACGTGCGTCCTTGACAAGGCTGGATCTTACAAGTGTGCCTGCTTGGCAGGC 
f"| TATAC TGGGCAGC GC TGTGAAAATC TCC T TGAAGAAAGAAACTGC TCAGACCCTGGGGGCCCAGTCAATGGGTAC C AGAAAA 

ff| TAACAGGGGGCCCTGGGCTTATCAACGGACGCCATGCTAAAATTGGCACCGTGGTGTCTTTCTTTTGTAACAACTCCTATGT 
% s jj T CT TAGTGGCAATGAGAAAAGAAC T TGCC AGCAGAATGGAGAGTGGTCAGGGAAACAGCC CATC TGCATAAAAGC CTGC C GA 

H?;; GAAC CAAAGAT T TCAGACC T GGTGAGAAGGAGAGT T CT TCCGATGCAGGT T CAGT C AAGGGAGAC ACCATTACACC AGCTAT 

. ?v. ACTC AGCGGCC T TCAGCAAGC AGAAAC TGCAGAGT GCCCC TACC AAGAAGCCAGC CCTTCCCTT TGGAGAT CTGCC CATGGG 

??: ATAC C AACATC TGCATACC CAGC TCCAGT ATGAGT GCATC TCACCCT TC TAC C GCC GCC TGGGCAGC AGCAGGAGGACATGT 

C TGAGGAC TGGGAAGT GGAGTGGGCGGGC AC C ATC C T GCAT CCC TATC TGC GGGAAAAT TGAGAACATC AC T GCTC CAAAGA 
^ C CCAAGGGTTGCGCT GGCCGT GGC AGGC AGCCATC TACAGGAGGACCAGCGGGGT GCATGACGGCAGCC TACACAAGGGAGC 

jfj GTGGTTCCTAGTCTGCAGCGGTGCCCTGGTGAATGAGCGCACTGTGGTGGTGGCTGCCCACTGTGTTACTGACCTGGGGAAG 

GTC AC CATGAT CAAGACAGC AGACC TGAAAGT TGT T TTGGGGAAATTC TACC GGGATGATGACCGGGATGAGAAGAC CATCC 
AGAGCCTACAGAT TT CT GC T ATCAT TC TGCATCCCAAC TAT GACCCCATCCT GCT TGATGC TGACATCGCC AT CCT GAAGCT 
^ CCTAGACAAGGC CCGTATC AGCACCCGAGTC CAGC C CAT C T GCC TCGC TGC CAGTCGGGATC TC AGC AC TT C C TT CC AGGAG 

TCCC ACATCAC TGTGGC TGGCTGGAATGTCC T GGCAGAC GT GAGGAGCCCTGGCT TCAAGAACGACAC ACT GC GC TC TGGGG 
* P TGGTCAGTGTGGTGGACTCGC TGC TGTGTGAGGAGCAGCAT GAGGACCATGGCATC CCAGT GAGTGTC ACTGATAACATGTT 

C TGTGCCAGCTGGGAAC CCAC TGCCCC T TCTGATAT CTGCACTGC AGAGACAGGAGGCATCGCGGCTGTGTCCTTCCCGGGA 
CGAGCATCTCCTGAGC(^ACGCTGGCATCTGATGGGACTGGTCAGCTGGAGCTATGATAAAACATGCAGCCACAGGCTCTCCA 
C TGCC TTCACCAAGGTGC TGC CTT T TAAAGAC TGGATTGAAAGAAATATGAA AT GAA CC ATGC TCATGCAC TC C TT GAGAAG 
TGTTTCTGTATATCCGTCTGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCCTGTGAACTTGGCT 
GTGCC AGGGCT TCTGAC T TC AGGGACAAAACTCAGT GAAGGGTGAGTAGACC T CC ATTGCTGGTAGGC TGATGCC GCGTCCA 
C TAC TAGGACAGCCAAT TGGAAGATGCC AGGGC TT GCAAGAAGTAAGT T TC T T CAAAGAAGAC CATATACAAAACC TC TC CA 
C TC CAC TGACC TGGT GGTC T TCCCCAAC T TT C AGT TATAC GAATGCC ATCAGC TTGACC AGGGAAGATC TGGGCT T C ATGAG 
GCCCCTTTTGAGGCTCTCAAGTTCTAGAGAGCTGCCTGTGGGACAGCCCAGGGCAGCAGAGCTGGGATGTGGTGCATGCCTT 
T GTGTACATGGCCACAGTACAGTC TGGTCCT TTTC C TTCCCCATC TC T TGTACAC AT TT TAAT AAAATAAGGGTTGGCTTCT 
GAACTACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 38 

MELGCWTQLGLTFLQLLLISSLPREYTVINEACPGAEWNIMCRECCEYDQIECVCPGKREWGYT 
IPCCRNEENECDSCLIHPGCTIFENCKSCRNGSWGGTLDDFYVKGFYCAECRAGWYGGDCMRCGQ 
VLRAPKGQ I LLE S YPLNAHCEWT I HAKPG FVI QLRFVMLS LE FDYMCQ YD YVE VRDGDNRDGQ I 1 
KRVCGNERPAP I QS I GS SLHVL FHS DGSKNFDGFHAI YEE I TAC S S S PC FHDGTCVLDKAGS YKC 
ACLAGYTGQRCENLLEERNCSDPGGPVNGYQKITGGPGLINGRHAKIGTWSFFCNNSYVLSGNE 
KRTCQQNGEWSGKQPICIKACREPKISDLVRRRVLPMQVQSRETPLHQLYSAAFSKQKLQSAPTK 
KPAL PFGDLPMGYQHLHTQLQYECI S P FYRRLGS SRRTCLRTGKWSGRAPS C I P I CGKIENI TAP 
KTQGLRWPWQAAI YRRT SGVHDGSLHKGAWFLVC SGALWERTVWAAHCVT DLGKVTMI KTADL 
KWLGKFYRDDDRDEKTIQSLQISAIILHPNYDPILLDADIAILKLLDKARISTRVQPICLAASR 
DLSTSFQESHITVAGWWIADVRSPGFKNDTLRSGWSWDSLLCEEQHEDHGIPVSVTDNMFCA 
SWEPTAPSDICTAETGGIAAVS FPGRASPEPRWHLMGLVSWSYDKTCSHRLSTAFTKVLPFKDWI 
ERNMK 

Important features of the protein: 
Signal peptide: 

amino acids 1-23 

EGF-like domain cysteine pattern signature, 
amino acids 260-272 
N-glycosylation sites . 

amino acids 96-100, 279-283, 316-320, 451-455, 614-618 
N-myristoylation sites. 

amino acids 35-41, 97-103, 256-262, 284-290, 298-304, 308-314, 

474-480, 491-497, 638-644, 666-672 

Amidation site. 

amino acids 56-60 

Serine proteases, trypsin family. 

amino acids 489-506 

CUB domain proteins profile. 

amino acids 150-167 
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FIGURE 2Q 



G GTTCCTACAT C CT CT CAT CT GAGAAT CAGAGAGCATAATCT T CTTAC GGGCC C GT GATTTATTAAC GT GGCT TAAT C 
T GAAGGT T CTCAGT CAAAT TCTTT GT GAT CTAC TGATT GTGGGGGCAT GGCAAGGTT T GCT TAAAGGAGCTT GGCTGG 
TTTGGGCCCTTGTAGCTGACAGAAGGTGGCCAGGGAGAATGCAGCACACTGCTCGGAGA ATGA AGGCGCTTCTGTTGC 
T GGT CT T G C CTT GGCT CAGTC CTGCT AACTACAT TGACAAT GT GGGCAACCT GCACTT CCT GT ATT CAGAACTCTGT A 
AAGGTGCCTCCCACTACGGCCTGACCAAAGATAGGAAGAGGCGCTCACAAGATGGCTGTCCAGACGGCTGTGCGAGCC 
T CACAGCCACGGCT CCCT CCC CAGAGGTTTCT G CAGCT GCCACCAT CT CCT TAATGACAGAC GAGCCT GGC CTAGAC A 
AC C CTGC CTACGT GT CCT CGGCAGAGGAC GGGCAGCCAGCAAT CAGC C CAGT G GACT CT GGC CGGAGCAAC CGAAC TA 
GGGCACGGC CCTTT GAGAGAT CCACTATTAGAAGCAGAT CATT TAAAAAAATAAATCGAGCT TTGAGT GTT CT TCGAA 
| 7S i GGACAAAGAGCGGGAGT GCAGT T GCCAAC CAT GC CGACCAGGGCAG GGAAAATT CT GAAAACACCACT GCC C CT GAAG 

f*\ TCTTTCCAAGGTTGTACCACCTGATTCCAGATGGTGAAATTACCAGCATCAAGATCAATCGAGTAGATCCCAGTGAAA 
fjj GCCT CT CTATTAGG C T GGT GGGAGGTAGCGAAAC CC CACTGGT CCATAT CAT TAT CCAACACAT TTAT C GT GAT GGGG 

If! T GAT C GCCAGAGAC GGCC GGCTACTGCCAGGAGACAT CATT CTAAAG GT CAAC GGGAT GGACAT CAGCAAT GT C CCT C 

||| ACAACTACGCTGTGCGTCTCCTGCGGCAGCCCTGCCAGGTGCTGTGGCTGACTGTGATGCGTGAACAGAAGTTCCGCA 
Ifl GCAGGAACAAT GGACAGGCCCC GGAT GCCTACAGACCCC GAGAT GACAGCTT T CAT GT GAT T CT CAACAAAAGTAGC C 

CCGAGGAGCAGCTTGGAATAAAACTGGTGCGCAAGGTGGATGAGCCTGGGGTTTTCATCTTCAATGTGCTGGATGGCG 
vll GTGT GG CAT AT C GACAT G GTCAGCTT GAGGAGAATGACC GT GTGT TAGCCAT CAAT GGACAT GATCTT C GATAT GGCA 

3 GCCCAGAAAGTGCGGCTCATCTGATTCAGGCCAGTGAAAGACGTGTTCACCTCGTCGTGTCCCGCCAGGTTCGGCAGC 
Q GGAGCCCT GACAT CTT T CAGGAAGCCGGCTGGAACAGCAAT GGCAGCT GGT C CCCAGGG CCAGGGGAGAGGAG CAACA 

|f| C T C CCAAGCCC C T CCAT CCTACAATTACTTGT CATGAGAAGGT GGTAAATATCCAAAAAGAC CC CGGT GAATCT CTCG 

Q GCATGACCGTCGCAGGGGGAGCATCACATAGAGAATGGGATTTGCCTATCTATGTCATCAGTGTTGAGCCCGGAGGAG 
i|| T CATAAG CAGAGAT GGAAGAATAAAAACAGGT GACAT T TTGT T GAAT GT GGAT GGG GT CGAACTGACAGAGGTCAGC C 

Q GGAGTGAGGCAGT GGCAT TAT T GAAAAGAACAT CATC CT CGATAGTACT CAAAGCTT T GGAAGT CAAAGAGTAT GAGC 

f|J C CCAGGAAGACT GCAG CAGCCCAGCAGC CCT G GACT C CAAC CACAACAT GGCC CCAC CCAGT GACT GGT CCC CATC CT 

GGGTCAT GT GGCT GGAATTACCAC GGT GCTTGTATAACTGTAAAGATATTGTAT TAC GAAGAAACACAGCT GGAAGT C 
TGGGCTTCTGCATTGTAGGAGGTTATGAAGAATACAATGGAAACAAACCTTTTTTCATCAAATCCATTGTTGAAGGAA 
CACCAGCATACAAT GAT GGAAGAATTAGAT GT GGT GATATT CT TCTT GCTGT CAAT G GTAGAAGTACAT CAGGAAT GA 
TACATGCTTGCTTGGCAAGACTGCTGAAAGAACTTAAAGGAAGAATTACTCTAACTATTGTTTCTTGGCCTGGCACTT 
TT TTATAGAAT CAAT GAT GGGTCAGAGGAAAACAGAAAAATCACAAAT AGGCTAAGAAGTT GAAACACTATAT TTATC 
TTGT CAGTT TT TATAT TTAAAGAAAGAATACATT GTAAAAAT GT CAG GAAAAGTAT GAT CAT CTAATGAAAGC CAGTT 
ACACCT CAGAAAATAT GAT TCCAAAAAAATTAAAACTACTAGTT T TTT TT CAGT GTGGAGGAT T TCT CATTACT CTAC 
AAC^TTGTTTATAT TT TTT CTAT TCAATAAAAAGC CCTAAAACAACTAAAAT GATT GATTT GTATACCC CACT GAATT 
CAAGCT GATTTAAATT TAAAATT T GGTATAT GCT GAAGT CT GC CAAGG GTACAT TATGGCCATT TT TAAT TTACAGCT 
AAAATATT TTTTAAAATGCATTGCTGAGAAACGTT GCTTT CAT CAAACAAGAATAAATATT TTT CAGAAGTTAAA 
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FIGURE 40 

MKALLLLVLPWLSPANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTATAPS 
PEVS AAAT I S LMTDE PGLDNPAYVS S AEDGQPAI S PVDSGRSNRTRARP FERS T I RS RS FKKI NR 
ALSVLRRTKSGSAVANHADQGRENSENTTAPEVFPRLYHLIPDGEITSIKINRVDPSESLSIRLV 
GGSETPLVHII IQHIYRDGVIARDGRLLPGDI ILKVNGMDISNVPHNYAVRLLRQPCQVLWLTVM 
REQKFRSRNNGQAPDAYRPRDDS FHVI LNKS S PEEQLGI KLVRKVDE PGVFI FNVLDGGVAYRHG 
QLEENDRVLAINGHDLRYGSPESAAHLIQASERRVHLWSRQVRQRSPDIFQEAGWNSNGSWSPG 
PGERSNTPKPLHPTITCHEKVVNIQKDPGESLGMTVAGGASHREWDLPIYVISVEPGGVISRDGR 
IKTGDILLNVDGVELTEVSRSEAVALLKRTSSSIVLKALEVKEYEPQEDCSSPAALDSNHNMAPP 
SDWSPSWVMWLELPRCLYNCKDIVLRRNTAGSLGFCIVGGYEEYNGNKPFFIKSIVEGTPAYNDG 
RIRCGDILLAVNGRSTSGMIHACLARLLKELKGRITLTIVSWPGTFL 

Important features: 
Signal peptide: 

amino acids 1-15 

N-glycosylation sites. 

amino acids 108-112, 157-161, 289-293, 384-388 

Tyrosine kinase phosphorylation sites. 

amino acids 433-441, 492-500 

N-myristoylation sites. 

amino acids 51-57, 141-147, 233-239, 344-350, 423-429, 447-453, 
467-473, 603-609 
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FIGURE 41 

ACCAGGCATTGTATCTTCAGTTGTCATCAA.GTTCGCAATCAGATTGGAAAAGCTCAACTTGAAGCTTT 
CTTGCCTGCAGTGAAGCAGAGAGATAGATATTATTCACGTAATAAAAAACATGGGCTTCAACCTGACT 
TTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCTGACAGTGGTTGGGTGGGC 
CACCAGTAACTACTTCGTGGGTGCCATTCAAGAGATTCCTAAAGCAAAGGAGTTCATGGCTAATTTCC 
ATAAGACCCTCATTTTGGGGAAGGGAAAAACTCTGACTAATGAAGCATCCACGAAGAAGGTAGAACTT 
GACAACTGTCCTTCTGTGTCTCCTTACCTCAGAGGCCAGAGCAAGCTCATTTTCAAACCAGATCTCAC 
TTTGGAAGAGGTACAGGCAGAAAATCCCAAAGTGTCCAGAGGCCGGTATCGCCCTCAGGAATGTAAAG 
CTTTACAGAGGGTCGCCATCCTCGTTCCCCACCGGAACAGAGAGAAACACCTGATGTACCTGCTGGAA 
CATCTGCATCCCTTCCT GCAGAGGCAGCAGCT GGATTATGGCAT CTACGTCAT CCACCAGGCT GAAGG 
TAAAAAGT TTAATCGAGCCAAACT CTT GAATGTGGGCTAT CTAGAAGCCCTCAAGGAAGAAAATT GGG 
ACTGCTTTATATTCCACGATGTGGACCTGGTACCCGAGAATGACTTTAACCTTTACAAGTGTGAGGAG 
CATCCCAAGCAT CTGGTGGT TGGCAGGAACAGCACTGGGT ACAGGTTACGT TACAGTGGAT AT TT TGG 
G GGT GTTAC T G C C CT AAGC AGAGAG CAGT T T T T CAAGGT GAAT GGAT TC T CTAAC AACTAC T G GGGAT 
GGGGAGGC GAAGACGATGACCTCAGACTCAGGGTTGAGCTCCAAAGAATGAAAATTTCCCGGCCCCTG 
CCTGAAGTGGGTAAATATACAATGGTCTTCCACACTAGAGACAAAGGCAATGAGGTGAACGCAGAACG 
GATGAAGCTCT TACAC CAAGT GTCACGAGTCTGGAGAACAGATGGGT TGAGTAGTT GTTCTTATAAA.T 
TAGTATCT GTGGAACACAAT CCTT T ATAT ATCAACATCACAGTGGAT TT CTGGTT T GGTGCATGACCC 
TGGATCTTTTGGTGATGTTTGGAAGAACTGATTCTTTGTTTGCAATAATTTTGGCCTAGAGACTTCAA 
ATAGTAGCACACATTAAGAACCTGTTACAGCTCATTGTTGAGCTGAATTTTTCCTTTTTGTATTTTCT 
TAGCAGAGCTCCTGGT GATGTAGAGTATAAAACAGT TGTAACAAGACAGCTT T CTTAGT CATTTTGAT 
CATGAGGGTTAAATATTGTAATATGGATACTTGAAGGACTTTATATAAAAGGATGACTCAAAGGATAA 
AAT GAAC GCTAT T T G AGGAC T CT GGT T GAAGG AGAT T TAT T TAAAT T T GAAGT AATATAT TAT GGGAT 
AAAAGGCCACAGGAAATAAGACTGCTGAATGTCTGAGAGAACCAGAGTTGTTCTCGTCCAAGGTAGAA 
AGGTACGAAGATACAATACTGTTATTCATTTATCCTGTACAATCATCTGTGAAGTGGTGGTGTCAGGT 
GAGAAGGCGTCCACAAAAGAGGGGAGAAAAGGCGACGAATCAGGACACAGTGAACTTGGGAATGAAGA 
G GTAGC AG GAGGGTG GAGT G T CGGCT GC AAAGG CAGCAGT AGCT GAG C T GG T T GCAG GT G CT GAT AGC 
CTTCAGGGGAGGACCTGCCCAGGTATGCCTTCCAGTGATGCCCACCAGAGAATACATTCTCTATTAGT 
T TT T AAAGAGTTT TT GTAAAATGATTTTGTACAAGTAGGATATGAAT TAGCAGTT TACAAGTTTACAT 
ATTAACTAATAATAAATATGTCTATCAAATACCTCTGTAGTAAAATGTGAAAAAGCAAAA 
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FIGURE 42 

MGFNLTFHLSYKFRLLLLLTLCLTWGWATSNYFVGAIQEIPKAKEFMMTFHKTLILGKGKTLTN 
EASTKKVELDNCPSVSPYLRGQSKLIFKPDLTLEEVQAENPKVSRGRYRPQECKALQRVAILVPH 
RNREKHLMYLLEHLHPFLQRQQLDYGIYVIHQAEGKKFNRAKLLNVGYLEALKEENWDCFIFHDV 
DLVPENDFNLYKCEEHPKHLWGRNSTGYRLRYSGYFGGVTALSREQFFKVNGFSNNYWGWGGED 
DDLRLRVELQRMKISRPLPEVGKYTMVFHTRDKGNEVNAERMKLLHQVSRVWRTDGLSSCSYKLV 
S VEHNPL Y INI TVDFW FGA 



H Important features: 

W Signal peptide: 

amino acids 1-27 

Iff N-glycosylation sites* 

Ul amino acids 4-8, 220-224, 335-339 

. 



Xylose isomerase proteins. 

amino acids 191-202 
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FIGURE 43 



GC TCAAGACCCAGCAG TGGGACAGCCAGACAGACGGCACG ATG GCACT GAGC TCCCAGATCT GGG 
CCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCACAACAG 
ACGGGACAACTTGCAGAGCTGCAACCCCAGGACAGAGCTGGAGCCAGGGCCAGCTGGATGCCCAT 
GTTCCAGAGGCGAAGGAGGCGAGACACCCACTTCCCCATCTGCATTTTCTGCTGCGGCTGCTGTC 
ATCGATCAAAGTGTGGGATGTGCTGCAAGACG TAGA ACCTACCTGCCCTGCCCCCGTCCCCTCCC 
TTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAATGGCTGGTTCTTTTGTTT 
TCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE dd 



MALSSQIWAACLLLLLLLASLTSGSVFPQQTGQLAELQPQDRAGARASWMPMFQRRRRRDTHFPI 
CIFCCGCCHRSKCGMCCKT 

Important features: 
Signal peptide: 

amino acids 1-24 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 58-59 

N-myristoylation site. 

amino acids 44-50 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 1-12 



APP ID=10063594 



Page 196 



FIGURE 45 



GTGGCTTCATTTCAGTGGCTGACTTCCAGAGAGCAATATGGCTGGTTCCCCAACATGCCTCACCC 
TCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTGGTCGGT 
TCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTCTATTGTCTG 
GACC TTCAACACAACCCC T CT TGTCAC CATACAGCCAGAAGGGGGCACTAT CATAGT GACC C AAA. 
AT CGTAAT AGGGAGAGAGT AGACT T CCCAGAT GGAGGCTACTCCCT GAAGCT CAGCAAACT GAAG 
AAGAATGACTCAGGGATCTACTATGTGGGGATATACAGCTCATCACTCCAGCAGCCCTCCACCCA 
GGAGTACGTGCTGCATGTCTACGAGCACCTGTCAAAGCCTAAAGTCACCATGGGTCTGCAGAGCA 
ATAAGAAT GGCACCT G T GT GACCAAT CTGACATGCTGCAT GGAACATGGGGAAGAGGAT GT GAT T 
TATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTC 
CTGGAGAT GGGGAGAAAGT GATATGACCT T CATC TGCGT T GCCAGGAACCC T GTCAGCAGAAAC T 
TCTCAAGCCCCATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATG 
GTCCTCCTGTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTG 
GT TT C T GAAGAGAGAGAGACAAGAAGAGTACAT TGAAGAGAAGAAGAGAG TGGACAT T TGT CGGG 
AAAC TCC TAACATATGCCCCCATTCT GGAGAGAACACAGAGTAC GACACAAT CCC TCACACTAAT 
AGAACAATCCTAAAGGAAGATCCAGCAAATACGGTTTACTCCACTGTGGAAATACCGAAAA?VGAT 
GGAAAAT CCCCACTCACT GCT CACGATGCCAGACACACCAAGGC TAT T T GCCTATGAGAATG TTA 
TC TAGAC AGC AG TGCACT CCCC TAAGT C T C TGC TCA 
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FIGURE 46 



MAGSPTCLTLIYILWQLTGSAASGPVKELVGSVGGAVTFPLKSKVKQVDSIVWTFNTTPLVTIQP 
EGGTIIVTQNRNRERVDFPDGGYSLKLSKLKKNDSGIYYVGIYSSSLQQPSTQEYVLHVYEHLSK 
PKVTMGLQSNKNGTCVTNLTCCMEHGEEDVIYTWKALGQAANESHNGSILPISWRWGESDMTFIC 
VARNPVSRNFS S P I LARKLCEGAADDPDS SMVXLCLLLVPLLLS L FVLGLFLWFLKRERQEE Y I E 
EKKRVDI CRETPNICPHSGENTEYDT I PHTNRT ILKEDPANTVYSTVE I PKKMENPHSLLTMPDT 
PRLFAYENVI 

Important features: 
Signal peptide: 

amino acids 1-22 

Transmembrane domain: 

amino acids 224-250 

Leucine zipper pattern. 

amino acids 229-251 

N-glycosylation sites. 

amino acids 98-102, 142-146, 148-152, 172-176, 176-180, 204-208, 
291-295 
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FIGURE 47 

GGCTCGAGCGTTTCTGAGCCAGGGGTGACC ATGA CCTGCTGCGAAGGATGGACATCCTGCAATGG 
ATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTGTCAGCT 
TAGTTGAGGAAGACCAATTTTCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTCCCAGGAATT 
ATAGGAGCAGGT C TGATGGCCAT TCCAGCAACAACAAT GT CCT T GACAGCAAGAAAAAGAGCGTG 
CTGGAACAACAGAACTGGAATGTTTCTTTCATCATTTTTCAGTGTGATCACAGTCATTGGTGCTC 
TGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTCATGTGTAATTCTCCAAGC 
AACAGTAAT GCCAAT TGT GAAT T T TCATT GAAAAACATCAGT GACAT T CAT CCAGAATCCT TCAA 
CTTGCAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTGGTTTCAATAAACCCACCAGTAACG 
ACACCAT GGCGAGT GGCT GGAGAGCAT CTAGT T TCCACT T CGAT T CTGAAGAAAACAAACATAGG 
CTTATCCACTTCTCAGTATTTTTAGGTCTATTGCTTGTTGGAATTCTGGAGGTCCTGTTTGGGCT 
CAGTCAGATAGTCATCGGTTTCCTTGGCTGTCTGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTG 
TG TAG TT TAAT GGGAATAAAATGTAAGTAT CAGTAGT T T GAAAAAA A A AAA 
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FIGURE 48 

mtccegwtscngfsllvllllgwlnaiplivslveedqfsqnpiscfewwfpgiigaglmaipa 
t tms ltarkraccnnrtgmfls s ffs vi tvi gal ycml i s i qallkgplmcns psnsnance fs l 
knisdihpes™lqwffndscapptgfnkptsndtmasgwrassfhfdseenkhrlihfsvflgl 
ll vg i levl fglsq i vi g flgclcgvskrrs q i v 

Important features: 
Transmembrane domains: 

amino acids 10-31 (type II) , 50-72, 87-110, 191-213 
N-glycosylation sites. 

amino acids 80-84, 132-136, 148-152, 163-167 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 223-227 

N-myristoylation sites. 

amino acids 22-28, 54-60, 83-89, 97-103, 216-222 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 207-218 

TNFR/NGFR family cysteine-rich region protein. 

amino acids 4-12 
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FIGURE AQ 

ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGGTGA 
AGGAGCAGTGAGGAACCTGCAGAGTCACACAGTTGCTGACCAATTGAGCTGTGAGCCTGGAGCAG 
ATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTCGAACTGTGA 
CATGGAGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGGAAGCCAATGACC 
CAT TT GCCAATAAAGACGAT CCC TTC T AC TATGACTGGAAAAACC TGCAGC TGAGCGGACT GATC 
TGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGGCAAATGCAAATACAAGAG 
y k CAGCCAGAAGCAGCACAGT CCT GTACC TGAGAAGGCCAT CC CACT CATCAC TCCAGGCT CTGCCA 

C3 CTACTTGCTGAGCACAGGACTGGCCTCCAGGGATGGCCTGAAGCCTAACACTGGCCCCCAGCACC 
|=f TCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTCTCTCCAAGGGCAGGCTGTTAGGCCCCT 

f I TTCTGATCAGGAGGCTTCTTTATGAATTAAACTCGCCCCACCACCCCCTCA 

III 

iff 
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FIGURE so 

MERVTLALLLLAGLTALEANDPFANKDDPFYYDWKNLQLSGLICGGLLAIAG IAAVLSGKCKYKS 
SQKQHS PVPEKAI PL I TPGS AT TC 

Important features: 
Signal peptide: 

amino acids 1-16 

Transmembrane domain: 

amino acids 36-59 

N-myristoylation sites. 

amino acids 41-47, 45-51, 84-90 

Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 . 

amino acids 54-67 
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FIGURE si 

GT GGAC T CT GAGAAGCCCAGGCAGT TGAGGACAGGAGAGAGAAGGCT GCAGACCCAGAGGGAGGG 
AGGACAGGGAGT CGGAAGGAGGAGGACAGAGGAGGGCACAGAGAC GCAGAGCAAGGGCGGCAAGG 
AGGAGACCC TGGT GGGAGGAAGACAC T CT GGAGAGAGAGGGGGC TGGGCAGAG ATGA AGT T CCAG 
GGGCCCCTGGCCTGCCTCCTGCTGGCCCTCTGCCTGGGCAGTGGGGAGGCTGGCCCCCTGCAGAG 
CGGAGAGGAAAGCACTGGGACAAATATTGGGGAGGCCCTTGGACATGGCCTGGGAGACGCCCTGA 
GCGAAGGGGTGGGAAAGGCCATTGGCAAAGAGGCCGGAGGGGCAGCTGGCTCTAAAGTCAGTGAG 
GCCCTTGGCCAAGGGACCAGAGAAGCAGTTGGCACTGGAGTCAGGCAGGTTCCAGGCTTTGGCGC 
AGCAGATGCTTTGGGCAACAGGGTCGGGGAAGCAGCCCATGCTCTGGGAAACACTGGGCACGAGA 
T T GGCAGACAGGCAGAAGAT GTCAT T CGACACGGAGCAGAT GC TGTCC GCGGCT CC TGGCAGGGG 
GTGCCTGGCCACAGTGGTGCTTGGGAAACTTCTGGAGGCCATGGCATCTTTGGCTCTCAAGGTGG 
CCTTGGAGGCCAGGGCCAGGGCAATCCTGGAGGTCTGGGGACTCCGTGGGTCCACGGATACCCCG 
GAAACTCAGCAGGCAGCTTTGGAATGAATCCTCAGGGAGCTCCCTGGGGTCAAGGAGGCAATGGA 
GGGCCACCAAACTTTGGGACCAACACTCAGGGAGCTGTGGCCCAGCCTGGCTATGGTTCAGTGAG 
AGCCAGCAACCAGAATGAAGGGTGCACGAATCCCCCACCATCTGGCTCAGGTGGAGGCTCCAGCA 
ACTCTGGGGGAGGCAGCGGCTCACAGTCGGGCAGCAGTGGCAGTGGCAGCAATGGTGACAACAAC 
AATGGCAGCAGCAGTGGTGGCAGCAGCAGTGGCAGCAGCAGTGGCAGCAGCAGTGGCGGCAGCAG 
TGGCGGCAGCAGTGGTGGCAGCAGTGGCAACAGTGGTGGCAGCAGAGGTGACAGCGGCAGTGAGT 
CCTCCTGGGGATCCAGCACCGGCTCCTCCTCCGGCAACCACGGTGGGAGCGGCGGAGGAAATGGA 
CATAAACCCGGGTGTGAAAAGCCAGGGAATGAAGCCCGCGGGAGCGGGGAATCTGGGATTCAGGG 
CTTCAGAGGACAGGGAGTTTCCAGCAACATGAGGGASATAAGCAAAGAGGGCAATCGCCTCCTTG 
GAGGCTCTGGAGACAATTATCGGGGGCAAGGGTCGAGCTGGGGCAGTGGAGGAGGTGACGCTGTT 
GGTGGAGT CAATAC TGTGAACTC TGAGACG TC T CC TGGGAT GT T TAACT T T GACACTT T C T GGAA 
GAAT T TTAAAT CCAAGCT GGGT T TCAT CAAC TGGGAT GCCATAAACAAGGACCAGAGAAGC TCTC 
GCATC CCG TGACCT CCAGACAAGGAGC CACCAGAT T GGAT GGGAGCCCCCACACTCCC T CC T T AA 
AACACCACCCTCTCATCACTAATCTCAGCCCTTGCCCTTGAAATAAACCTTAGCTGCCCCACAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAM 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA?W\^^ 
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FIGURE *2 



MKFQGPLACLLLALCLGSGEAGPLQSGEESTGTNIGEALGHGLGD7VLSEGVGKAIGKEAGGAAGSKVS 
EALGQGTREAVGTGVRQVPGFGAADALGNRVGEAAHALGNTGHEIGRQAEDVIRHGADAVRGSWQGVP 
GHSGAWETSGGHGIFGSQGGLGGQGQGNPGGLGTPWVHGYPGNSAGSFGMNPQGAPWGQGGNGGPPNF 
GTNTQGAVAQPGYGSVRASNQNEGCTNPPPSGSGGGSSNSGGGSGSQSGSSGSGSNGDNNNGSSSGGS 
SSGSSSGSSSGGSSGGSSGGSSGNSGGSRGDSGSESSWGSSTGSSSGNHGGSGGGNGHKPGCEKPGNE 
ARGSGESGIQGFRGQGVSSNMREISKEGNRLLGGSGDNYRGQGSSWGSGGGDAVGGVNTVNSETSPGM 
FNFDT FWKNFKSKLGFINWDAINKDQRS SRI P 

Signal peptide: 

amino acids 1-21 



N-glycosylation site. 

amino acids 265-269 

s Glycosaminoglycan attachment site. 

amino acids 235-239, 237-241, 244-248, 255-259, 324-328, 388-392 

{*! Casein kinase II phosphorylation site. 

ih amino acids 26-30, 109-113, 259-263, 300-304, 304-308 

ru 

N-myristoylation site. 

amino acids 17-23, 32-38, 42-48, 50-56, 60-66, 61-67, 64-70, 74-80, 

90-96, 96-102, 130-136, 140-146, 149-155, 152-158, 155-161, 

159-165, 163-169, 178-184, 190-196, 194-200, 199-205, 218-224, 

236-242, 238-244, 239-245, 240-246, 245-251, 246-252, 249-252, 

253-259, 256-262, 266-272, 270-276, 271-277, 275-281, 279-285, 

283-289, 284-290, 287-293, 288-294, 291-297, 292-298, 295-301, 

298-304, 305-311, 311-317, 315-321, 319-325, 322-328, 323-329, 

325-331, 343-349, 354-360, 356-362, 374-380, 381-387, 383-389, 
387-393, 389-395, 395-401 

Cell attachment sequence. 

amino acids 301-304 
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FIGURE 

GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGGATGTCGCTGCTGAGCCTGCCCTGG 
CTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTCCTGGCT 
ACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCCAGTGTTTCC 
CACAGCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCTACAGAGGAGGGC 
TTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGTATGGCTGGGTCCCAT 
CATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCACCAATGCCTCAGCTGCCA 
TTGCACCCAAGGATAATCTCTTCATCAGGTTCCTGAAGCCCTGGCTGGGAGAAGGGATACTGCTG 
AGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGACGCCCGCCTTCCATTTCAACATCCT 
GAAGT CC TATATAACGAT C T T CAACAAGAGT GCAAACATCATGC T TGACAAGTGGCAGCACCTGG 
CCTCAGAGGGCAGCAGTCGTCTGGACATGTTTGAGCACATCAGCCTCATGACCTTGGACAGTCTA 
CAGAAATGCATCTTCAGCTTTGACAGCCATTGTCAGGAGAGGCCCAGTGAATATATTGCCACCAT 
CTTGGAGCTCAGTGCCCTTGTAGAGAAAAGAAGCCAGCATATCCTCCAGCACATGGACTTTCTGT 
ATTACCTCTCCCATGACGGGCGGCGCTTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGAC 
GCTGTCATCCGGGAGCGGCGTCGCACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAA 
AGCCAAGTCCAAGACTTTGGATTTCATTGATGTGCTTCTGCTGAGCAAGGATGAAGATGGGAAGG 
CATTGTCAGATGAGGATATAAGAGCAGAGGCTGACACCTTCATGTTTGGAGGCCATGACACCACG 
GCCAGTGGCCTCTCCTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCG 
ACAGGAGGT GCAAGAGC T T CTGAAGGACC GCGATCC TAAAGAGAT T GAATGGGACGACCTGGCCC 
AGCTGCCCTTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATC 
TCCCGATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTG 
CCTCATCGATATTATAGGGGTCCATCACAACCCAACTGTGTGGCCGGATCCTGAGGTCTACGACC 
CCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTCCGCA 
GGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGGCGTTGAT 
GCTGCTGCACTTCCGGTTCCTGCCAGACCACACTGAGCCCCGCAGGAAGCTGGAATTGATCATGC 
GCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCAGTGACTTTCTGAC 
CCATCCACCTGT TT TTTT GCAGAT TGTCAT GAATAAAACGGTGC TGTCAAA 
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FIGURE *A 

MSLLSLPWLGLRPVAMSPWLLLLLWGSWLIARILAWT^ 

L I TPTEEGLKDS TQMSAT YSQGFTVWLGPI I P FI VLCHPDT IRS I TNASAAI APKDNLFIRFLKP 

WLGEGILLSGGDKWSRHRRMLTPAFHFNILKSYITIFNKSANIMLDKWQHLASEGSSRLDMFEHI 

SLMTLDSLQKC I FS FDSHCQERPSEYI AT I LELSALVEKRSQHI LQHMDFLYYLSHDGRRFHRAC 

RLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDGKALSDEDIRAEADTF 

MFGGHDTTASGLSWLYNLARHPEYQERCRQEVQELLKDRDPKEIEWDDLAQLPFLTMCVKESLR 

LHPPAPFISRCCTQDIVLPDGRVIPKGITGLIDIIGVHHNPTWPDPEVYDPFRFDPENSKGRSP 

lAFIPFSAGPRNCIGQAFAMAEMKWLALM^ 

VGLQ 

Important features: 
Transmembrane domains: 

amino acids 13-32 (type II), 77-102 

Cytochrome P450 cysteine heme-iron ligand signature. 

amino acids 461-471 

N-glycosylation sites. 

amino acids 112-116, 168-172 
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FIGURE 



ATCGCATCAATTGGGAGTACCATCTTCCTCATGGGACCAGTGAAACAGCTGAAGCGAATGTTTGA 
GCCTACTCGTTTGATTGCAACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTTCTGCCT 
TTTGGTGGCATAACAAGGGACTTGCACTTATCTTCTGCATTTTGCAGTCTTTGGCATTGACGTGG 
TACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGCCGTGTGTCTTGC 
ATAATTCATGGCCAGT TTTATGAAGC T TT GGAAGGCACTATGGACAGAAGCTGGTGGACAGTTTT 
GTAACTATCTTCGAAACCTCTGTCTTACAGACATGTGCCTTTTATCTTGCAGCAATGTGTTGCTT 
, GTGATTCGAACATTTGAGGGTTACTTTTGGAAGCAACAATACATTCTCGAACCTGAATGTCAGTA 
m GCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAATCTTCCTCATGTACCTGTTTCCTC 
fl TC TGGAT GT T G TCCCACT GAAT T CCCATGAATACAAAC CT AT TCAGCAACAGCAAAAAAAAAAAA 

111 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

:: 57 

uj 

3^5 



4i| 



I'll 
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FIGURE *6 



MGPVKQLKRMFE PTRL I AT IMVLLC FALTLCS AFWWHNKGLAL I FC I LQS LALT W YSLS FI PFAR 
DAVKKC FAVC LA 

Important features: 
Signal peptide: 

amino acids 1-33 

H 

pj Type II fibronectin collagen-binding domain protein. 

El amino acids 30-72 

§ 

m 
$ 

a 
is 
a 
w 

fy 
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FIGURE 57 



CGGCTCGAGCTCGAGCCGAATCGGCTCGAGGGGCAGTGGAGCACCCAGCA6GCCGCCAACATGCTCTGTCTGTGCCTG 
T AC GTG CCGGT CAT CGGGGAAGCCCAGACCGAGTT CCAGTACTTT GAGT C GAAGGGGCT CCCT GCCGAGCT GAAGT CC 
ATT TT CAAGCT CAGTGTCT T CAT C C CCT C CCAGGAATT CT C CAC CTAC CGC CAGT GGAAGCAGAAAAT T GTACAAGCT 
GGAGATAAG GACCT T GAT GGGCAG CTAGACTTT GAAGAATTT GT CCATTAT CT CCAAGAT CAT GAGAAGAAGCT GAGG 
CTGGTGTTTAAGATTTTGGACAAAAAGAATGATGGACGCATTGACGCGCAGGAGATCATGCAGTCCCTGCGGGACTTG 
GGAGTCAAGATATCTGAACAGCAGGCAGAAAAAATTCTCAAGAGCATGGATAAAAACGGCACGATGACCATCGACTGG 
AACGAGTGGAGAGACTACCACCTC CT CCACCCC GT G GAAAACAT CC CC GAGAT CAT CC T CTACT GGAAGC ATT C CACG 
ATCTTTGATGTGGGTGAGAATCTAACGGTCCCGGATGAGTTCACAGTGGAGGAGAGGCAGACGGGGATGTGGTGGAGA 
CACCTGGTGGCAGGAGGTGGGGCAGGGGCCGTATCCAGAACCTGCACGGCCCCCCTGGACAGGCTCAAGGTGCTCATG 
CAGGT CCAT GCCT CCC GCAGCAACAACAT GGGCAT CGT T G GTGGCTTCACT CAGAT GAT T CGAGAAGGAGG GGC CAGG 
TCACTCTGGCGGGGCAATGGCATC^ACGTCCTCAAAATTGCCCCCGAATCAGCCATCAAATTCATGGCCTATGAGCAG 
ATCAAGCGC CT TGTTGGT AGT GAC CAGGAGACT CT GAGGAT TCACGAGAGGCTT GT GGCAG GGT CCTT GG CAGG GGCC 
%zk ATC GC CCAGAGCAGCAT CT ACC CAAT GGAGGT C CTGAAGAC CC GGAT GGCG CT GCGGAAGACAGGCCAGTACTCAGGA 

?** ATGCTGGACTGCGCCAGGAGGATCCTGGCCAGAGAGGGGGTGGCCGCCTTCTACAAAGGCTATGTCCCCAACATGCTG 

G GCAT CAT C C C CTAT GCCGGCATC GACCT T GCAGT CTACGAGACGCT CAAGAAT GCCT G GCTGCAGCACTATGCAGT G 
Q AACAGCGCGGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGGCCAGCTAC 
m CCCCTGGCCCTAGTCAGGACCCGGATGCAGGCGCAAGCCTCTATTGAGGGCGCTCCGGAGGTGACCATGAGCAGCCTC 
!% TTCAAACATATCCTGCGGACCGAGGGGGCCTTCGGGCTGTACAGGGGGCTGGCCCCCAACTTCATGAAGGTCATCCCA 
lif G CT GT GAG CAT CAGCTACGT GGT CTACGAGAACCT GAAGAT CACCCTGGGC GT GCAGT C GCGGTGACGGGGGGAGGGC 

HI C GC CC GGCAGT GGACT CGCT GAT C CT GG GCCG CAGC CT G GGGT GTGCAGCCAT CT CATT C T GT GAAT GT GCCAACACT 

AAGCTGTCTCGAGCCAAGCTGTGAAAACCCTAGACGCACCCGCAGGGAGGGTGGGGAGAGCTGGCAGGCCCAGGGCTT 
Uj GTCCTGCTGACCCCAGCAGACCCTCCTGTTGGTTCCAGCGAAGACCACAGGCATTCCTTAGGGTCCAGGGTCAGCAGG 
J* CTCCGGGCTCACATGTGTAAGGACAGGACATTTTCTGCAGTGCCTGCCAATAGTGAGCTTGGAGCCTGGAGGCCGGCT 
**** TAGTTCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGCCGTGCATCTCCCTGTGC 
* CCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAGCCCACATCCCACCCCCTCGTCCAATCCC 
f"% ATAATCCATGATGAAAGGTGAGGTCACGTGGCCTCCCAGGCCTGACTTCCCAACCTACAGCATTGACGCCAACTTGGC 
t% TGTGAAGGAAGAGGAAAGGATCTGGCCTTGTGGTCACTGGCATCTGAGCCCTGCTGATGGCTGGGGCTCTCGGGCATG 
If I CTTGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCATGGTGCTCTGAGCT 
gl GGCCTGGACCCTGTCAGGATGGGCCCCACCTCAGAACCAAACTCACTGTCCCCACTGTGGCATGAGGGCAGTGGAGCA 
3^ CCATGTTTGAGGGCGAAGGGC^GAGCGTTTGTGTGTTCTGGGGAGGGAAGGAAAAGGTGTTGGAGGCCTTAATTATGG 
III ACT GT T GGGAAAAGGGTTT T GT CCAGAAGGACAAGC CGGACAAATGAGCGACTT CT GT GCT TCCAGAG GAAGACGAGG 

f*| GAGCAGGAGCT TGGCT GAC T GCT CAGAGT CT GT T CT GAC GC CCT GGGGGTT CCTGT CCAACCCCAGCAGGGGCGCAGC 

Jf;; GGGACCAGCC CCACAT T C CACTT GT GT CACT GCTT GGAACCTAT TT AT TTT GTATTTATTT GAACAGAGTTAT GTC CT 

I II AACTATTTTTATAGATTTGTTTAATTAATAGCTTGTCATTTTCAAGTTCATTTTTTATTCATATTTATGTTCAT.GGTT 

GATTGTACCTTCCCAAGCCCGCCCAGTGGGATGGGAGGAGGAGGAGAAGGGGGGCCTTGGGCCGCTGCAGTCACATCT 
GTC CAGAGAAATTCCT TT T GGGACTGGAGGCAGAAAAGC GGCCAGAAGGCAGCAGCCCT GGCT CCT TT CCTTT GGCAG 
GTTGGGGAAGGGCTTGCCCCCAGCCTTAGGATTTCAGGGTTTGACTGGGGGCGTGGAGAGAGAGGGAGGAACCTCAAT 
AAC CT T GAAGGTGGAAT C CAGTTATT T C CT G C GCT GCGAGGGTT T CTT TATT TCACTCTTT T CT GAAT GT CAAGGCAG 
TGAGGTGCCTCTCACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCCCAGC 
CTTCTGCTGCCCTTGCTTAACAATGCCGGCCAACTGGCGACCTCACGGTTGCACTTCCATTCCACCAGAATGACCTGA 
T GAG GAAAT CT TCAATAGGAT GCAAAGAT CAAT GCAAAAATT GT TATATAT GAACATATAACT GGAGT C GT CAAAAAG 
CAAAT TAAGAAAGAAT T GGACGTTAGAAGTT GT C ATTT AAAGCAGCCT T CTAATAAAGTT GTTT CAAAGCT GAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAkAAAAAAAAAAAAAAAAAAA^ 
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FIGURE *8 

MLCLCLYVPVIGEAQTEFQYFESKGLPAELKSIFKLSVFIPSQEFSTYRQWKQKIVQAGDKDLDG 
QLDFEEFVHYLQDHEKKLRLVFKILDKKNDGRIDAQEIMQSLRDLGVKISEQQAEKILKSMDKNG 
TMT I DWNEWRDYHLLHPVENI PE I IL YWKHST I FDVGENLTVPDE FTVEERQTGMWWRHLVAGGG 
AGAVSRTCTAPLDRLKVMQVHASRSNNMGIVGGFTQMIREGGARSLWRGNGINVLKIAPESAIK 
FblAYEQIKRLVGSDQETLRIHERLVAGSIAGAIAQSSIYPMEVLKTRMALRKTGQYSGMLDCARR 
I LAREGVAAFYKG YVPNMLG 1 1 P YAG I DLAVYE TLKNAWLQHYAVNSADPGVFVLLACGTMS S TC 
GQLAS YPLALVRTRMQAQAS IEGAPEVTMS SLFKHILRTEGAFGLYRGLAPNFMKVIPAVS ISYV 
VYENLKITLGVQSR 

Important features: 
Signal peptide: 

amino acids 1-16 

Putative transmembrane domains: 

amino acids 284-304, 339-360, 376-394 

Mitochondrial energy transfer proteins signature. 

amino acids 206-215, 300-309 

N-glycosylation sites. 

amino acids 129-133, 169-173 

Elongation Factor-hand calcium-binding protein. 

amino acids 54-73, 85-104, 121-140 



APP ID=10063594 



Page 210 



FIGURE so 

GGAAGGCAGCGGCAGCTCCACTCAGCCAGTACCCAGATACGCTGGGAACCTTCCCCAGCCATGGC 
TTCCCTGGGGCAGATCCTCTTCTGGAGCATAATTAGCATCATCATTATTCTGGCTGGAGCAATTG 
CACTCATCATTGGCTTTGGTATTTCAGGGAGACACTCCATCACAGTCACTACTGTCGCCTCAGCT 
GGGAACAT TGGGGAGGATGGAATCCT GAGCTGCAC T T T T GAACC TGACATCAAAC T T TC TGATAT 
CGTGATACAATGGCTGAAGGAAGGTGTTTTAGGCTTGGTCCATGAGTTCAAAGAAGGCAAAGATG 
AGCTGTCGGAGCAGGATGAAAT G T TCAGAGGCCGGACAGCAGT GT TT GCTGATCAAGT GATAGT T 
! sL . GGCAATGCCTCTTTGCGGCTGAAAAACGTGCAACTCACAGATGCTGGCACCTACAAATGTTATAT 
□ CATCACT T CTAAAGGCAAGGGGAATGCTAACC T T GAGT ATAAAAC TGGAGCC T TCAGCAT GCCGG 

Q AAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTGCGGTGTGAGGCTCCCCGATGGTTCCCC 
fl CAGCCCACAGTGGTCTGGGCATCCCAAGTTGACCAGGGAGCCAACTTCTCGGAAGTCTCCAATAC 

2 : » 

i% CAGCTTTGAGCTGAACTCTGAGAATGTGACCATGAAGGTTGTGTCTGTGCTCTACAATGTTACGA 

ill 

~?% T CAACAACACATACTCCT GT AT GAT T GAAAATGACATT GCCAAAGCAACAGGGGAT ATCAAAGTG 

*p ACAGAATCGGAGATCAAAAGGCGGAGT CACCTACAGCTGC TAAACTCAAAGGCT TCTCTGT GTGT 

* CTCTTCTTTCTTTGCCATCAGCTGGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAAATAAT 

____ 

ft GTGCCT T GGCCACAAAAAAGCATGCAAAG TCAT TGT TACAACAGGGAT CTACAGAACTAT T TCAC 

m CACCAGATATGAC C TAGTT T TAT AT T T CT GGGAGGAAAT GAAT T CATATC TAGAAG TCT GGAGTG 

|,-| AGCAAACAAGAGCAAGAAACAAAAAGAAGCCAAAAGCAGAAGGC T CCAAT AT GAACAAGATAAAT 

Q C TAT C T T CAAAGACATAT TAGAAGT TGGGAAAATAAT TCAT GTGAACTAGACAAGTGT GTTAAGA 

fM GTGATAAGTAAAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGT 

CAC C T G G G GAG T G AGAG G AC AG GAT AG T G CAT GTTCTTTGTCTCT GAAT T T T TAG T TAT AT G T G C 
TGTAATGTTGCTCTGAGGAAGCCCCTGGAAAGTCTATCCCAACATATCCACATCTTATATTCCAC 
AAATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGGGGCG 
GCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCTTGGCTTC 
TCTTCCCAACTGACAAATGCCAAAGTTGAGAAAAATGATCATAATTTTAGCATAAACAGAGCAGT 
CGGGGACACCGATTTTATAAATAAACTGAGCACCTTCTTTTTAAACAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 60 



MASLGQILFWSIISIIIILAGAIALIIGFGISGRHSITVTTVASAGNIGEDGILSCTFEPDIKLS 
DIVIQWLKEGVLGLVHEFKEGKDELSEQDEMFRGRTAVFADQVIVGNASLRLKNVQLTDAGTYKC 
YI I TSKGKGNANLEYKTGAFSMPEVWDYNAS SETLRCEAPRWFPQPTVVWASQVDQGANFSEVS 
NTSFELNSENVTMKWSVLYNVTINNTYSCMIENDIAKATGDIKVTESEIKRRSHLQLLNSKASL 
CVSSFFAISWALLPLSPYLMLK 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 258-281 

N-glycosylation sites. 

amino acids 112-116, 160-164, 190-194, 196-200, 205-209, 216-220, 
220-224 

N-myristoylation sites. 

amino acids 52-58, 126-132, 188-194 
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FIGURE 61 

TGACGTCAGAATCACCATGGCCAGCTATCCTTACCGGCAGGGCTGCCCAGGAGCTGCAGGACAAG 
CACCAGGAGCCCCTCCGGGTAGCTACTACCCTGGACCCCCCAATAGTGGAGGGCAGTATGGTAGT 
GGGCTACCCCCTGGTGGTGGTTATGGGGGTCCTGCCCCTGGAGGGCCTTATGGACCACCAGCTGG 
TGGAGGGCCCTATGGACACCCCAATCCTGGGATGTTCCCCTCTGGAACTCCAGGAGGACCATATG 
GCGGTGCAGCTCCCGGGGGCCCCTATGGTCAGCCACCTCCAAGTTCCTACGGTGCCCAGCAGCCT 
GGGCTTTATGGACAGGGTGGCGCCCCTCCCAATGTGGATCCTGAGGCCTACTCCTGGTTCCAGTC 
y„ GGTGGACT CAGATCACAGT GGC TATATCT CCATGAAGGAGC TAAAGCAGGCCCT GGTCAAC T GCA 

Q ATTGGTCTTCATTCAATGATGAGACCTGCCTCATGATGATAAACATGTTTGACAAGACCAAGTCA 
W GGCCGCATCGATGTCTACGGCTTCTCAGCCCTGTGGAAATTCATCCAGCAGTGGAAGAACCTCTT 

CCAGCAGTATGACCGGGACCGC T CGGGCT CCAT TAGCTACACAGAGC TGCAGCAAGCTC TGT CCC 

?2 AAATGGGCTACAACCTGAGCCCCCAGTTCACCCAGCTTCTGGTCTCCCGCTACTGCCCACGCTCT 

111 

~H GCCAATCCTGCCATGCAGCTTGACCGCTTCATCCAGGTGTGCACCCAGCTGCAGGTGCTGACAGA 
.JZ GGCCTTCCGGGAGAAGGACACAGCTGTACAAGGCAACATCCGGCTCAGCTTCGAGGACTTCGTCA 
S CCATGACAGCTTCTCGGATGCTATGACCCAACCATCTGTGGAGAGTGGAGTGCACCAGGGACCTT 

TCCTGGCTTCTTAGAGTGAGAGAAGTATGTGGACATCTCTTCTTTTCCTGTCCCTCTAGAAGAAC 
Ik* ATTCTCCCTTGCTTGATGCAACACTGTTCCAAAAGAGGGTGGAGAGTCCTGCATCATAGCCACCA 
yi AATAGT GAGGACCGGGGC TGAGGCCACACAGATAGGGGCCT GAT GGAGGAGAGGATAGAAGT TGA 

Q ATGTCCTGATGGCCATGAGCAGTTGAGTGGCACAGCCTGGCACCAGGAGCAGGTCCTTGTAATGG 
111 AGTTAGTGTCCAGTCAGCTGAGCTCCACCCTGATGCCAGTGGTGAGTGTTCATCGGCCTGTTACC 

GTTAGTACCTGTGTTCCCTCACCAGGCCATCCTGTCAAACGAGCCCATTTTCTCCAAAGTGGAAT 
C TGAC CAAGCAT GAGAGAGATCT GTCTAT GGGACCAGTGGC TTGGAT TCTGCCACACCCAT AAAT 
CCTTGTGTGTTAACTTCTAGCTGCCTGGGGCTGGCCCTGCTCAGACAAATCTGCTCCCTGGGCAT 
CTTTGGCCAGGCTTCTGCCCCCTGCAGCTGGGACCCCTCACTTGCCTGCCATGCTCTGCTCGGCT 
TCAGTCTCCAGGAGACAGTGGTCACCTCTCCCTGCCAATACTTTTTTTAATTTGCATTTTTTTTC 
ATTTGGGGCCAAAAGTCCAGTGAAATTGTAAGCTTCAATAAAAGGATGAAACTCTGA 
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FIGURE 62 



MASYPYRQGCPGAAGQAPGAPPGSYYPGPPNSGGQYGSGLPPGGGYGGPAPGGPYGPPAGGGPYG 
HPNPGMFPSGTPGGPYGGAAPGGPYGQPPPSSYGAQQPGLYGQGGAPPNVDPEAYSWFQSVDSDH 
SGYISMKELKQALWCNWSSFNDETCLMMIN^ 

DRSGSISYTELQQALSQMGYNLSPQFTQLLVSRYCPRSANPAMQLDRFIQVCTQLQVLTEAFREK 
DTAVQGNIRLS FEDFVTMTASRML 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids 147-150 

Casein kinase II phosphorylation sites. 

amino acids 135-138, 150-153, 202-205, 271-274 

N-myristoylation sites. 

amino acids 9-14, 15-20, 19-24, 33-38, 34-39, 39-44, 43-48, 61- 
66, 70-75, 78-83, 83-88, 87-92, 110-115 
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FIGURE 6s 

CAGGATGCAGGGCCGCGTGGCAGGGAGCTGCGCTCCTCTGGGCCTGCTCCTGGTCTGTCTTCATC 
TCCCAGGCCTCTTTGCCCGGAGCATCGGTGTTGTGGAGGAGAAAGTTTCCCAAAACTTCGGGACC 
AACTTGCCTCAGCTCGGACAACCTTCCTCCACTGGCCCCTCTAACTCTGAACATCCGCAGCCCGC 
TCTGGACCCTAGGTCTAATGACTTGGCAAGGGTTCCTCTGAAGCTCAGCGTGCCTCCATCAGATG 
GCTTCCCACCTGCAGGAGGTTCTGCAGTGCAGAGGTGGCCTCCATCGTGGGGGCTGCCTGCCATG 
GATTCCTGGCCCCCTGAGGATCCTTGGCAGATGATGGCTGCTGCGGCTGAGGACCGCCTGGGGGA 
AGCGCTGCCTGAAGAACTCTCTTACCTCTCCAGTGCTGCGGCCCTCGCTCCGGGCAGTGGCCCTT 
m TGCCTGGGGAGTCTTCTCCCGATGCCACAGGCCTCTCACCTGAGGCTTCACTCCTCCACCAGGAC 
f\ TCGGAGTCCAGACGACTGCCCCGTTCTAATTCACTGGGAGCCGGGGGAAAAATCCTTTCCCAACG 
ff! CCCTCCCTGGTCTCTCATCCACAGGGTTCTGCCTGATCACCCCTGGGGTACCCTGAATCCCAGTG 
W TGTCCTGGGGAGGTGGAGGCCCTGGGACTGGTTGGGGAACGAGGCCCATGCCACACCCTGAGGGA 
jj| ATCTGGGGTATCAATAATCAACCCCCAGGTACCAGCTGGGGAAATAT TAAT CGGTATCCAGGAGG 

'2 CAGCTGGGGASATATTAATCGGTATCCAGGAGGCAGCTGGGGGAATATTAATCGGTATCCAGGAG 
GCAGC TGGGG GAAT AT T CAT CT AT ACC C AG G TAT C AAT AAC CC AT T T C C T C C TGGAG TTCTCCGC 
O CCTCCTGGCTCTTCTTGGAACATCCCAGCTGGCTTCCCTAATCCTCCAAGCCCTAGGTTGCAGTG 

GGGCTAGAGCACGATAGAGGGAAACCCAACATTGGGAGTTAGAGTCCTGCTCCCGCCCCTTGCTG 
|ji TGTGGGCTCAATCCAGGCCCTGTTAACATGTTTCCAGCACTATCCCCACTTTTCAGTGCCTCCCC 
p| T GCT CATC TCCAATAAAATAAAAGCACT TAT GAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

fll AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^AAAA^ 
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FIGURE 64 

MQGRVAGSCAPLGLLLVCLHLPGLFARS I GWEEKVSQNFGTNLPQLGQPSS TGPSNSEHPQPAL 
DPRSNDLARVPLKLSVPPSDGFPPAGGSAVQRWPPSWGLPAMDSWPPEDPWQMMAAAAEDRLGEA 
LPEELSYLSSAAALAPGSGPLPGESSPDATGLSPEASLLHQDSESRRLPRSNSLGAGGKILSQRP 
PWSLIHRVLPDHPWGTLNPSVSWGGGGPGTGWGTRPMPHPEGIWGINNQPPGTSWGNINRYPGGS 
WGNINRYPGGSWGNINRYPGGSWGNIHLYPGINNPFPPGVLRPPGSSWNIPAGFPNPPSPRLQWG 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Casein kinase II phosphorylation sites. 

amino acids 56-59, 155-158 

N-rayjri s toyl a ti on s i tes . 

amino acids 48-53, 220-225, 221-226, 224-229, 247-252, 258-263, 
259-264, 269-274, 270-275, 280-285, 281-286, 305-310 
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FIGURE 65 

AAGGAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCCCAGGAGCGCAGTGGCCACT ATG GGGTC 
TGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGGGTATGA 
CTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCCTATGAGTCCAGCTTCCTGGAA 
TTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCACCCTCCACCATGC 
AAGATCTCAACACCATGTTGTCTGCAACACATGACAGCCATTGAAGCCTGTGTCCTTCTTGGCCC 
GGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTTCAGCAGGCCCCCACCCTC 
CTGAGTGGCAATAAATAAAATTCGGTATGCTG 

r\ 
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FIGURE 66 

MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYESSFLELLEKLCLLLHLPSGTSVTL 
HHARS QHHWCNT 

Important features: 
Signal peptide: 

amino acids 1-19 



f^r N-glycosylation site. 

Q amino acids 37-41 

P 

^2; N-myristoylation sites. 

~~ amino acids 15-21, 19-25, 60-66 
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FIGURE 67 



ACGGACCGAGGGTTCGAGGGAGGGACACGGACCAGGAACCTGAGCTAGGTCAAAGACGCCCGGGC 
CAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGCGGTAGGAGGGGCGAGCGCGAGAAGCCCC 
TTCCTCGGCGCTGCCAACCCGCCACCCAGCCCATGGCGAACCCCGGGCTGGGGCTGCTTCTGGCG 
CTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACAGACCACTTCTGC 
AAATGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCTCCGATGGCAACCTGCGTCCGG 
AAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTGCTCCTGGCTGTGGGGCTG 
GCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGGCACCTACCGGCCCAGTAGCGA 
l7 t GGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGGACTCCAAGGAGACGGTGCAGGGCT 
f s % GCCTGCCCATCTAGGTCCCCTCTCCTGCATCTGTCTCCCTTCATTGCTGTGTGACCTTGGGGAAA 
if! GGCAGTGCCCTCTCT GGGCAGT CAGAT CCACCCAGTGCT TAATAGCAGGGAAGAAGGTACT T CAA 

W AGACT CTGCCCCT GAGG TCAAGAGAGGAT GGGGC TAT TCAC TT T TAT AT AT T TATATAAAAT TAG 

% jf T AGT GAGAT GTAAAAAAAAAAAAAAAAAA 

;;. ; H 



yl 



W 
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FIGURE 68 

MANPGLGLLLALGLPFLLARWGRAWGQIQTTSANENSTVLPSSTSSSSDGNLRPEAITAIIWFS 
LLAALLLAVGLALLTOKLREKRQTEGT^ 

Important features: 
Signal peptide: 

amino acids 1-19 

Transmembrane domain: 

amino acids 56-80 

N-glycosylation site. 

amino acids 36-40 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 86-90 

Tyrosine kinase phosphorylation site. 

amino acids 86-94 

N-myristoylation sites. 

amino acids 7-13, 26-32 
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FIGURE 6q 



GCCAGGAATAACT AGAGAGGAACAATGGGGT T ATT CAGAGGTTTT GTT TT CCT CT TAGT T CTGT GC CT GCT GCAC CAG 
T CAAAT ACTT C CTTCATTAAGCT GAATAATAATGGCTT TGAAGATATTGT CAT T GTTATAGATCCTAGT GTGC CAGAA 
GAT GAAAAAATAAT T GAACAAATAGAGGATATGGT GACTACAGCTT CTACGT AC CTGTT T GAAGCCACAGAAAAAAGA 
TTTTT TT T CAAAAAT GT ATCTATATTAATT CCT GAGAATT GGAAGGAAAATCCT CAGTACAAAAGG CCAAAACAT GAA 
AACCATAAACATG CT GAT GTTATAGT TGCACCACCTACACT CCCAGGT AGAGAT GAAC CATACAC CAAG CAGTTCACA 
GAAT GT GGAGAGAAAGGCGAATACATT CACT TCACC CCT GACCTT CT ACT T GGAAAAAAACAAAAT GAATATG GACCA 
CCAGG CAAACT GTT T GTCCAT GAGT GGGCT CACCT C CGGT GGGGAGT GT TT GATGAGTACAATGAAGAT CAGC CTTTC 
T ACC GT GCTAAGT CAAAAAAAAT C GAAGCAACAAGGTGTT C CGCAGGTAT CT CT GGTAGAAATAGAGTTTATAAGT GT 
C^GGAGGC^GCTGTCTTAGTAGAGCATGCAGAATTGATTCTACAACTVAAACTGTATGGAAAAGATTGTCAATTCTTT 
C CT GATAAAGTACAAACAGAAAAAGCAT CCATAAT GTT TAT GCAAAGTAT T GATT CTGTT GTT GAATTTT GTAAC GAA 
AAAACCCATAATCAAGAAG CT CCAAGCCTACAAAACATAAAGT GCAAT TT T AGAAGTACAT GGGAGGT GATTAGCAAT 
T CT GAGGAT TT TAAAAACACCATACC CAT G GTGACACCACCTC CT CCACCT GT CTT CT CAT TGCT GAAGAT CAGT CAA 
AGAAT T GT GT G CTTAGTT CT T GAT AAGT CTGGAAGCAT GGGGGGT AAGGAC CGC CTAAAT CGAAT GAAT CAAGCAGCA 
Zll AAACATTT CCT GCT GCAGACT GTT GAAAAT GGAT CCTGGGT GGGGAT GGT T CACT TTGATAGTACT GCCACTATT GTA 

Q AAT AAGCT AAT CCAAATAAAAAGCAGTGAT GAAAGAAACACACTCAT GG CAGGATT ACCTACATAT CCT CT GGGAGGA 

f*| ACTTCCAT CT GCT CT GGAATTAAATATGCATTT CAGGT GATTGGAGAGCTACATT C CCAACT CGAT GGAT CCGAAGTA 

III CTGCTGCTGACTGATGGGGAGGATAACACTGCAAGTTCTTGTATTGATGAAGTGAAACAAAGTGGGGCCATTGTTCAT 
IP 5 T TTATT GCTTT GGGAAGAGCT GCT GATGAAGCAGTAAT AGAGATGAGCAAGATAACAGGAGGAAGT CATTT TTAT GTT 

|J. T CAGAT GAAGCTCAGAACAAT GGCCTCAT T GAT GCTTT T GGGGCT CTTACATCAGGAAATACT GAT CT CT CCCAGAAG 

si*. TCCCTTCAGCTCGAAAGTAAGGGATTAACACTGAATAGTAATGCCTGGATGAACGACACTGTCATAATTGATAGTACA 
I? I GTGGGAAAG GACACGTTCTT T CT CAT CACAT GGAACAGT CT GC CT CC CAGTAT TT CT CT CT GGGAT CC C AGTGGAACA 

%?l ATAAT GGAAAATTT CACAGT GGAT GCAACTT CCAAAAT G GCCTATCT CAGTAT T CCAGGAACT GCAAAGGT GGGCACT 

% TGGGCATACAATCTTCAAGCCAAAGCGAACCCAGAAACATTAACTATTACAGTAACTTCTCGAGCAGCAAATTCTTCT 
4* G T GC CT CCAAT CACAGT GAAT GCTAAAAT GAATAAGGACGT AAACAGTTTC CCCAGCCCAAT GATT GT TTACGCAGAA 

5 ATTCTACAAGGATATGTACCTGTTCTTGGAGCCAATGTGACTGCTTTCATTGAATCACAGAATGGACATACAGAAGTT 
^ T TGGAACT T TT GGATAAT GGT GCAGG CGCT GAT T CTTT CAAGAATGAT GGAGT CTACTCCAGGTATT TTACAGCAT AT 

W ACAGAAAAT GG CAGATATAGCTTAAAAGT TCGGGCT CAT GGAGGAGCAAACACT GC CAGGCTAAAATTACGGC CT C CA 

Ifl CT GAATAGAGC C GCGTACAT ACCAGGCT GGGT AGTGAAC GG GGAAATTGAAGCAAACC CGC CAAGAC CT GAAATTGAT 

74 GAG GATACT CAGACCACCT T GGAGGAT TT CAGC CGAACAGCAT CCGGAGGTGCATTT GTGGTAT CACAAGTCCCAAGC 

Isl CTTCCCTTGCCTGACCAATACCCACCAAGTCAAATCACAGACCTTGATGCCACAGTTCATGAGGATAAGATTATTCTT 
|.|| ACAT GGAC AGCACCAGGAGATAATTT T GATGTT GGAAAAGTTCAACGTTATAT CATAAGAAT AAGT GCAAGT ATTCT T 

GATCTAAGAGACAGTTTTGATGATGCTCTTCAAGTAAATACTACTGATCTGTCACCAAAGGAGGCCAACTCCAAGGAA 
**! AGCTTTGCATTTAAACCAGAAAATAT CT CA.GAAGAAAAT GCAAC CCACATATT TATT GCCATTAAAAGTATAGAT AAA 

HI AGCAAT TT GACAT CAAAAGTATC CAACAT TGCACAAGTAACT TT GT TTAT CC CT CAAGCAAAT CCT GAT GACAT T GAT 

C CT ACACCTACT C CTACT CCTACT CCT ACTCCT GATAAAAGTCATAAT T CT GGAGTTAATATTT CTAC GCT G GTATT G 
T CT GTGATT GGGT CT GTT GT AATT GTTAACTTT ATT TT AAGTACCACCATTTGAAC CTT AAC GAAGAAAAAAAT CTTC 
AAGTAGACCTAGAAGAGAGT T TTAAAAAACAAAACAAT GTAAGTAAAGGATAT T TCTGAAT CTTAAAAT TCAT CCCAT 
GTGTGATCATAAACTCATAAAAATAATTTTAAGATGTCGGAAAAGGATACTTTGATTAAATAAAAACACTCATGGATA 
T GTAAAAACTGT CAAGATT AAAAT TTAAT AGT T T CATTTATTT GTT AT TTTATTTGTAAGAAAT AGT GATGAACAAAG 
ATCCTTTTTCATACTGATACCTGGTTGTATATTATTTGATGCAACAGTTTTCTGAAATGATATTTCAAATTGCATCAA 
GAAATTAAAAT CAT CTATCTGAGTAGT CAAAAT ACAAGTAAAGGAGAGCAAATAAAC^AACATT T GGAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 70 



MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDEKIIEQIEDMVTTASTYLFE 
ATEKRFFFKNVSILIPENWKENPQYKRPKHENHKHADVIVAPPTLPGRDEPYTKQFTECGEKGEY 
IHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSKKIEATRCSAGISGRN 
RVYKCQGGSCLSRACRIDSTTKLYGKDCQFFPDKVQTEKAS IMFMQSIDSWEFCNEKTHNQEAP 
SLQNIKCNFRSTWEVISNSEDFKNTIPMVTPPPPPVFSLLKISQRIVCLVLDKSGSMGGKDRLNR 
MNQA?^FLLQTVENGSWGMVHFDSTATIWKLIQIKSSDERNTimGLPTYPLGGTSICSGIK 
, t YAFQVIGELHSQLDGSEVLLLTDGEDNTASSCIDEVKQSGAIVHFIALGRAADEAVIEMSKITGG 
f* % SHFWSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKGLTLNSNAWMNDTVIIDSTVGKDTFFL 
jTj ITWNSLPPSISLWDPSGTIMENFTVDATSKMAYLSIPGTAKVGTWAYNLQAKANPETLTITVTSR 
CIS AANS S VPP I TVNAKMNKDVNS FPSPMIVYAE I LQGYVPVLGANVTAFI E SQNGHTEVLELLDNGA 

^ GADS FKNDGVYSRYFTAYTENGRYSLKVRAHGGANTARLKLRPPLNRAAY I PGWWNGE IEANPP 

5 ? 

:; 2 ?. 

^| RPEIDEDTQTTLEDFSRTASGGAFWSQVPSLPLPDQYPPSQITDLDATVHEDKIILTWTAPGDN 
p FDVGKVQRY 1 1 RI SAS I LDLRDS FDDALQVNT TDLS PKEANSKE S FAFKPENI SEENATH I FIAI 

9 KSIDKSNLTSKVSNIAQVTLFIPQANPDDIDPTPTPTPTPTPDKSHNSGVNISTLVLSVIGSWI 
O VNFILSTTI 

Hi ' 

Signal peptide: 

amino acids 1-21 

Putative transmembrane domains: 

amino acids 284-300, 617-633 

Leucine zipper pattern. 

amino acids 469-491, 476-498 

N-glycosylation site. 

amino acids 20-24, 75-79, 340-344, 504-508, 542-546, 588-592, 
628-632, 811-815, 832-836, 837-841, 852-856, 896-900 



Hi 
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FIGURE 71 



CT CCTTAG GT GGAAAC CCTGGGAGTAGAGTACT GACAGCAAAGACCGGGAAAGACCATAC GTCCC CGGGCAGGGGT GA 
CAACAGGTGTCATCTTTTTGATCTCGTGTGTGGCTGCCTTCCTATTTCAAGGAAAGACGCCAAGGTAATTTTGACCCA 
GAGGAGCAAT GAT GTAGCCACCT CCTAACCTTC C CTTCTT GAACC CCCAGT TAT GCCAGGATTTACTAGAGAGT GTCA 
ACTCAACCAGCAAGCGGCTCCTTCGGCTTAACTTGTGGTTGGAGGAGAGAACCTTTGTGGGGCTGCGTTCTCTTAGCA 
GTGCTCAGAAGTGACTTGCCTGAGGGTGGACCAGAAGAAAGGAAAGGTCCCCTCTTGCTGTTGGCTGCACATCAGGAA 
GGCTGT GAT GGGAAT GAAGGT GAAAACTT GGAGAT TTCACTTCAGT CATT GCTT CT GCCTGCAAGATCAT CCTT TAAA 
AGTAGAGAAGCTGCTCTGTGTGGTGGTTAACTCCAAGAGGCAGAACTCGTTCTAGAAGGAAATGGATGCAAGCAGCTC 
CGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCTTTGAGGGATGCC 
ACCGGTTCTGGACGCATGGCTGATTCCTG AATGA TGATGGTTCGCCGGGGGCTGCTTGCGTGGATTTCCCGGGTGGTG 
GTTTTGCTGGTGCTCCTCTGCTGTGCTATCTCTGTCCTGTACATGTTGGCCTGCACCCCAAAAGGTGACGAGGAGCAG 
CTGGCACTGCCCAGGGCCAACAGCCCCACGGGGAAGGAGGGGTACCAGGCCGTCCTTCAGGAGTGGGAGGAGCAGCAC 
CGCAACTACGT GAGCAGCCT GAAG CGGCAGATCGCACAGCT CAAGGAG GAGCT GCAGGAGAGGAGT GAGCAGCT CAGG 
AATGGGCAGTACCAAGCCAGCGATGCTGCTGGCCTGGGTCTGGACAGGAGCCCCCCAGAGAAAACCCAGGCCGACCTC 
CTGGCCT TC CT GCACT CGCAGGT GGACAAGGCAGAGGT GAAT GCT GGC GT CAAG C T GGCCACAGAGTAT G CAGCAGTG 
CCT TT C GATAGCTT TACT CTACAGAAGGT GTACCAGCT G GAGACT GGC CTT ACC C GCCACC CCGAGGAGAAGCCT GTG 
iel AGGAAGGACAAGC GGGAT GAGTT GGT GGAAGCCATT GAAT CAG CCT TGGAGACC CT GAACAAT CCT GCAGAGAACAGC 

CCCAAT CACC GTC CTTACACGGC CTCTGATT T CATAGAAGGGAT CTACCGAACAGAAAGGGACAAAGGGACATT GTAT 
GAGCT CAC CTT CAAAGGGGAC CACAAACAC GAATT CAAACGGCT C ATCTTATTT CGAC CAT TCAGCCC CAT CAT GAAA 
GTGAAAAATGAAAAGCTCAACATGGCC^C^CGCTTATC^ATGTTATCGTGCCTCTAGCAAAAAGGGTGGACAAGTTC 
C GGCAGTT CAT GC AGAATTT CAGGGAGAT GT GCAT TGAGCAGGAT G GGAGAGT C CATCT CACT GTT GTT TACTT T GGG 
AAAGAAGAAAT AAAT GAAGT CAAAGGAAT ACTT GAAAACACTT CCAAAG CT GCCAACTT CAGGAACTT TACCTT CATC 
CAGCTGAATGGAGAATTTTCTCGGGGAAAGGGACTTGATGTTGGAGCCCGCTTCTGGAAGGGAAGCAACGTCCTTCTC 
T TT TT CTGT GATGTGGACAT CTACTT CACAT CT GAATT C CT CAATACGT GTAGGCT GAATACACAGCCAGG GAAGAAG 
GTAT TTTAT CCAGTT CTTT T CAGT CAGT ACAAT CCT GGCATAAT ATACGGCCAC CATGATGCAGTCC CT CCCTTGGAA 
CAGCAGCT GGT CAT AAAGAAG GAAACTG GATTT T GGAGAGACTT T GGATT T GGGAT GACGT GT CAGTAT CGGT CAGAC 
TT CAT GAAT ATAGGT GGGTTT GAT C T GGACATCAAAGGCT GGGGC GGAGAGGAT GT GCACCTT TAT CGCAAGTATCTC 
CACAGCAACCT CATAGT GGTACGGAC GC CT GTGCGAGGACT CTT C CAC CT CTGGCATGAGAAGCGCT GCAT GGAC GAG 
CTGACC CC CGAGCAGTACAAGAT GT GCAT GCAGT CCAAGGC CAT GAAC GAGGCAT C CCACGGCCAGCT GGGCAT GCTG 
GTGTT CAGGCACGAGATAGAGGCT CACCT T C GCAAACAGAAACAGAAGACAAGTAGCAAAAAAAC ATGAA CT CCCAGA 
GAAGGATT GT GGGAGACACTT TTT CTTT C CTTTT GCAATTACT GAAAGT GGCTGCAACAGAGAAAAGACTTCCATAAA 
GGACGACAAAAGAATTGGACTGATGGGTCAGAGATGAGAAAGCCTCCGATTTCTCTCTGTTGGGCTTTTTACAACAGA 
AATCAAAATCTCCGCTTTGCCTGCAAAAGTAACCCAGTTGCACCCTGTGAAGTGTCTGACAAAGGCAGAATGCTTGTG 
AGATTATAAGCCTAATGGTGTGGAGGTTTTGATGGTGTTTACAATACACTGAGACCTGTTGTTTTGTGTGCTCATTGA 
AATATT CAT GATTTAAGAGCAGTT TT GTAAAAAATTCATTAGCAT GAAAGGCAAG CAT ATTT CT CCT CATAT GAAT GA 
GCCT AT CAGCAGGGCT CTAGTTT CT AG GAAT GCTAAAATAT CAGAAGGCAGGAGAGGAGATAGGCT TATTAT GATACT 
AGT GAGTACATTAAGTAAAATAAAAT GGACCAGAAAAGAAAAGAAAC CAT AAATAT CGT GT CATAT TTT CC CCAAGAT 
TAACCAAAAATAATCTGCTTATCTTTTTGGTTGTCCTTTTAACTGTCTCCGTTTTTTTCTTTTATTTAAAAATGCACT 
TTTTTTCCCTTGT GAGTT ATAGT CTG CTTAT TTAATT AC CACTT T GCAAGC CT TACAAGAGAGCACAAGTT G GC CTAC 
ATTTT TATATTTT TTAAGAAGATACTTT GAGAT GCATTATGAGAACTT TCAGTT CAAAGCAT CAAATT GAT G CCAT AT 
CCAAGGACAT GCCAAATGCTGATTCT GT CAGGCACT GAAT GTCAGGCATTGAGACATAGGGAAGGAAT GGTTTGTACT 
AATACAGAC GTACAGATACT TTC T CT GAAGAGTATTTT C GAAGAGGAGCAACT GAACACTGGAGGAAAAGAAAATGAC 
ACTT T CT GCTT TACAGAAAAGGAAACTCATT CAGACT GGTGATAT CGTGAT GTACCTAAAAGT CAGAAACCACATTTT 
CT CC TCAGAAGTAGGGACC GCTT T CTTACCT GTTTAAATAAAC CAAAGTATACC GT GT GAAC CAAA.CAAT CT CTTTT C 
AAAACAGGGT GCT CCT CCT GGCTT CT GGCTT CCATAAGAAGAAATGGAGAAAAATAT ATATATATATAT ATATATT GT 
GAAAGATCAATCCATCTGCCAGAATCTAGTGGGATGGAAGTTTTTGCTACATGTTATCCACCCCAGGCCAGGTGGAAG 
T AACT GAAT TATTT TT TAAATTAAGCAGTT CTACTCAAT CAC CAAGAT G CT T CT GAAAATT GCATTTTATTACCATTT 
CAAACTATT TT TT AAAAATAAATACAGTTAACATAGAGTGGTTT CTTCATT CAT GT GAAAATTATTAGC CAGCACCAG 
ATGC^TGAGCTAATTATCTCTTTGAGTCCTTGCTTCTGTTTGCTCACAGTAAACTCATTGTTTAAAAGCTTCAAGAAC 
ATTCAAGCTGTTGGTGTGTTAAAAAATGCATTGTATTGATTTGTACTGGTAGTTTATGAAATTTAATTAAAACACAGG 
CCATGAAT GGAAGGT GGTATT GCACAGCTAATAAAATAT GATT T GT GGATATGAA 
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FIGURE 72 

MMl^RGLIAWISRVV^ 

EQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADLLAFLHSQVDK 
AE VNAGVKLATE YAAVP FDS FT LQKVYQLETGL TRHPEEKPVRKDKRDELVEAI E S ALE TLNNPA 
ENS PNHRPYT AS DFI EG I YRTERDKGT LYELT FKGDHKHEFKRL I L FRPFS P IMKVKNEKLNMAN 
TLINVIVPLAKRVDKFRQFMQNFREMCIEQDGRVHLTWYFGKEEINEVKGILENTSKAANTFRNF 
TFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQY 
NPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYR 
KYLHSNLIVWTPTOGLFHLWHEKRCMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHEI 
RKQKQKTSSKKT 

Important features: 
Signal peptide : 

amino acids 1-27 

N-glycosylation sites. 

amino acids 315-319, 324-328 

N-myristoylation sites. 

amino acids 96-102, 136-142, 212-218, 311-317, 339-345, 393-399 

Ainidation site. 

amino acids 377-381 
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FIGURE 72 

GAGACTGCAGAGGGAGATAAAGAGAGAGGGCAAAGAGGCAGCAAGAGATTTGTCCTGGGGATCCA 
GAAACCCATGATACCCTACTGAACACCGAATCCCCTGGAAGCCCACAGAGACAGAGACAGCAAGA 
GAAGCAGAGATAAATACACTCACGCCAGGAGCTCGCTCGCTGTCTCTCTCTCTCTCTCACTCCTC 
CCTCCCTCTCTCTCTGCCTGTCCTAGTCCTCTAGTCCTCAAATTCCCAGTCCCCTGCACCCCTTC 
CTGGGACACT ATG TTGTTCTCCGCCCTCCTGCTGGAGGTGATTTGGATGCTGGCTGCAGATGGGG 
GTCAACACTGGACGTATGAGGGCCCACATGGTCAGGACCATTGGCCAGCCTCTTACCCTGAGTGT 
GGAAACAAT GC CCAGTCGCCCAT CGAT AT TCAGACAGACAGTGTGACAT T T GACCCTGATT TGCC 

«» T GCT C TGCAGCCCCACGGATATGACCAGCC T GGCACCGAGCCT T T GGACCT GCACAACAAT GGCC 

4,1 

Q ACACAGTGCAACTCTCTCTGCCCTCTACCCTGTATCTGGGTGGACTTCCCCGAAAATATGTAGCT 

If! GCCCAGCTCCACCTGCACTGGGGTCAGAAAGGATCCCCAGGGGGGTCAGAACACCAGAICAACAG 

^ T GAAGCCACAT T TGCAGAGC TCCACATTG T ACAT TATGAC TCT GAT T CCT AT GACAGC TTGAGT G 

%| AGGCTGCTGAGAGGCCTCAGGGCCTGGCTGTCCTGGGCATCCTAATTGAGGTGGGTGAGACTAAG 

% AATATAGCTTATGAACACAT TCTGAGTCACT TGCATGAAGTCAGGCATAAAGATCAGAAGACC TC 

* AGTGCCTCCCTTCAACCTAAGAGAGCTGCTCCCCAAACAGCTGGGGCAGTACTTCCGCTACAATG 

O GCTCGCTCACAACTCCCCCTTGCTACCAGAGTGTGCTCTGGACAGTTTTTTATAGAAGGTCCCAG 

AT TT CAAT GGAACAGC TGGAAAAGC T T CAGGGGACAT TGT TCT C CACAGAAGAGGAGCCCT C TAA 

?"! GCTTCTGGTACAGAACTACCGAGCCCTTCAGCCTCTCAATCAGCGCATGGTCTTTGCTTCTTTCA 

ill 

pj ; TCCAAGCAGGATCCTCGTATACCACAGGTGAAATGCTGAGTCTAGGTGTAGGAATCTTGGTTGGC 
flJ TGTCTCTGCCTTCTCCTGGCTGTTTATTTCATTGCTAGAAAGATTCGGAAGAAGAGGCTGGAAAA 

CCGAAAGAGT GTGGT C T T CACC T CAGCACAAGCCACGACT GAGGC ATAA AT T CCT T CT CAGATAC 
CATGGATGTGGATGACTTCCCTTCATGCCTATCAGGAAGCCTCTAAAATGGGGTGTAGGATCTGG 
CCAGAAACACTGTAGGAGTAGTAAGCAGAT GTCC TCC T TCCCCT GGACAT C T C TTAGAGAGGAAT 
GGAC CCAGGCTGTCAT TCCAGGAAGAACTGCAGAGCC T TCAGCC T CT CCAAACAT GTAGGAGGAA 
ATGAGGAAATCGCTGTGTTGTTAATGCAGAGANCAAACTCTGTTTAGTTGCAGGGGAAGTTTGGG 
ATATACCCCAAAGTCCTCTACCCCCTCACTTTTATGGCCCTTTCCCTAGATATACTGCGGGATCT 
CT CC T T AGGATAAAGAGTTGCT GT TGAAGT T GTATAT TT T TGAT CAATATAT T TGGAAAT TAAAG 
TTTCTGACTTT 
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FIGURE 74 

MLFSALLLEVIWIIJUU3GGQHWTYEGPHGQDHWPASYPECGNNAQSPIDIQTDSVTFDPDLPALQ 
PHGYDQPGTEPLDLHNNGHTVQLSLPSTLYLGGLPRKYVAAQLHLHWGQKGSPGGSEHQINSEAT 
FAELH I VHYDS DS YDS LS EAAERPQGLAVLG IL IEVGETKN IAYEH I LSHLHEVRHKDQKTS VPP 
FNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGTLFSTEEEPSKLLV 
QNYRALQPLNQRMVFASFIQAGSSYTTGEMLSLGVGILVGCLCLLIJ\VYFIARKIRKKRLENRKS 
WFTSAQATTEA 

Important features of the protein: 
Signal peptide: 
amino acids 1-15 

Transmembrane domain: 

amino acids 291-310 

N-glycosylation site. 

amino acids 213-216 

Eukaryotic-type carbonic anhydrases proteins 

amino acids 197-245, 104-140, 22-69 
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FIGURE 7R 

TGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTGTGTC 
TCTGGTGGTTTGCCTAAACCTGCAAACATCACCTTCTTATCCATCAACATGAAGAATGTCCTACA 
AT G GAC T C C AC C AG AG G G T C T T C AAG GAG T T AAAG T T AC T T AC AC T G T G C AG TAT T T CA T C AC AA 
ATT GGCCCACCAGAGGTGGCAC TGAC TACAGATGAGAAGTCCAT TT CT GT T GTCC TGACAGCTCC 
AGAGAAGTGGAAGAGAAATCCAGAAGACCT TCCTGTT TCCATGCAACAAATATACTCCAATCTGA 
AGTATAACGTGTCTGTGTTGAATACTAAATCAAACAGAACGTGGTCCCAGTGTGTGACCAACCAC 
ACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGTACACGTGGAGTCCTTCGTCCC 
AGGGCCCCCTCGCCGTGCTCAGCCTTCTGAGAAGCAGTGTGCCAGGACTTTGAAAGATCAATCAT 
CAGAGTTCAAGGCTAAAATCATCTTCTGGTATGTTTTGCCCATATCTATTACCGTGTTTCTTTTT 
T CT G T GAT GGGC TAT TCCAT CT ACCGAT AT ATCCACGT TGGCAAAGAGAAACACCCAGCAAATT T 
GATTTTGATTTATGGAAATGAATTTGACAAAAGATTCTTTGTGCCTGCTGAAAAAATCGTGATTA 
ACT T TATCACCC T CAATATCTCGGAT GAT TC TAAAAT T TC TCATCAGGATATGAGT T TACTGGGA 
AAAAGCAGTGATGTATCCAGCCTTAATGATCCTCAGCCCAGCGGGAACCTGAGGCCCCCTCAGGA 
GGAAGAGGAGGT GAAACAT T TAGGGTATGC T TCGCAT T TGAT GGAAAT T TT T TG TGAC T C T GAAG 
AAAACACGGAAGGTACTTCTCTCACCCAGCAAGAGTCCCTCAGCAGAACAATACCCCCGGATAAA 
ACAG T CAT T GAATAT GAATATGAT GT CAGAACCACTGACAT TT GT GCGGGGCC T GAAGAGCAGGA 
GCTCAGTTT GCAGGAGGAGGTGTCCACACAAGGAACATTAT TGGAGT CGCAGGCAGCGT TGGCAG 
TCTTGGGCCCGCAAACGTTACAGTACTCATACACCCCTCAGCTCCAAGACTTAGACCCCCTGGCG 
CAGGAGCACACAGACTCGGAGGAGGGGCCGGAGGAAGAGCCATCGACGACCCTGGTCGACTGGGA 
TCCCCAAACTGGCAGGCTGTGTATTCCTTCGCTGTCCAGCTTCGACCAGGATTCAGAGGGCTGCG 
AGCCTTCTGAGGGGGATGGGCTCGGAGAGGAGGGTCTTCTATCTAGACTCTATGAGGAGCCGGCT 
CCAGACAGGCCACCAGGAGAAAAT GAAACC TATC TCAT GCAATT CAT GGAGGAATGGGGGT TATA 
TGTGCAGATGGAAAACTGATGCCAACACTTCCTTTTGCCTTTTGTTTCCTGTGCAAACAAGTGAG 
T CACCC CT T T GATCCCAGCCAT AAAGTACCT GGGATGAAAGAAGT T T T T TCCAGT T TGT CAG T GT 
CTGTGAGAATTACTTATTTCTTTTCTCTATTCTCATAGCACGTGTGTGATTGGTTCATGCATGTA 
GGTCTCTTAACAATGATGGTGGGCCTCTGGAGTCCAGGGGCTGGCCGGTTGTTCTATGCAGAGAA 
AGCAGTCAATAAATGTTTGCCAGACTGGGTGCAGAATTTATTCAGGTGGGTGT 
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FIGURE 76 

MSYNGLHQRVFKELKLLTLCSISSQIGPPEVALTTDEKSISWLTAPEKWKRNPEDLPVSMQQIY 
SNLKYWSVLNTKSNRTWSQCVTNHTLVLTWLEPNTLYCVHVESFVPGPPRRAQPSEKQCARTLK 
DQSSEFKAKIIFWYVLPISITVFLFSVMGYSIYRYIHVGKEKHPANLILIYGNEFDKRFFVPAEK 
IVINFITLNISDDSKISHQDMSLLGKSSDVSSLNDPQPSGNLRPPQEEEEVKHLGYASHLMEIFC 
DSEENTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPEEQELSLQEEVSTQGTLLESQA 
ALAVLGPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEEEPSTTLVDWDPQTGRLCIPSLSSFDQDS 
EGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENETYLMQFMEEWGLYVQMEN 

Important features: 
Signal peptide: 

amino acids 1-2 8 

Transmembrane domain: 

amino acids 140-163 

N-glycosylation sites. 

amino acids 71-74, 80-83, 89-92, 204-207, 423-426 
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FIGURE 77 



If! 



GAGGAGCGGGCC.GAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGCACTTGCTGCCCTCTGACAC 
CTGGGAA.G ATGG CCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGCCACCTTGATC 
CAAGCCACCCTCAGTCCCACTGCAGTTCTCATCCTCGGCCCAAAAGTCATCAAAGAAAAGCTGAC 
ACAGGAGCTGAAGGACCACAACGCCACCAGCATCCTGCAGCAGCTGCCGCTGCTCAGTGCCATGC 
GGGAAAAGCCAGCCGGAGGCATCCCTGTGCTGGGCAGCCTGGTGAACACCGTCCTGAAGCACATC 
ATCTGGCTGAAGGTCATCACAGCTAACATCCTCCAGCTGCAGGTGAAGCCCTCGGCCAATGACCA 
GGAGCTGCTAGTCAAGATCCCCCTGGACATGGTGGCTGGATTCAACACGCCCCTGGTCAAGACCA 

ft* 

"r% TCGTGGAGTTCCACATGACGACTGAGGCCCAAGCCACCATCCGCATGGACACCAGTGCAAGTGGC 
Q CCCACCCGCCTGGTCCTCAGTGACTGTGCCACCAGCCATGGGAGCCTGCGCATCCAACTGCTGTA 
0* TAAGCTCTCCTTCCTGGTGAACGCCTTAGCTAAGCAGGTCATGAACCTCCTAGTGCCATCCCTGC 

CCAATCTAGTGAAAAACCAGCTGTGTCCCGTGATCGAGGCTTCCTTCAATGGCATGTATGCAGAC 
CTCCTGCAGCTGGTGAAGGTGCCCATTTCCCTCAGCATTGACCGTCTGGAGTTTGACCTTCTGTA 
TCCTGCCATCAAGGGTGACACCATTCAGCTCTACCTGGGGGCCAAGTTGTTGGACTCACAGGGAA 
AGGTGACCAAGTGGTTCAATAACTCTGCAGCTTCCCTGACAATGCCCACCCTGGACAACATCCCG 
TTCAGCCTCATCGTGAGTCAGGACGTGGTGAAAGCTGCAGTGGCTGCTGTGCTCTCTCCAGAAGA 
ATTCATGGTCCTGTTGGACTCTGTGCTTCCTGAGAGTGCCCATCGGCTGAAGTCAAGCATCGGGC 
TGATCAAT GAAAAGGCTGCAGATAAGCTGGGATCTACCCAGATCGTGAAGAT CCTAAC TCAGGAC 
ACTCCCGAGTTTTTTATAGACCAAGGCCATGCCAAGGTGGCCCAACTGATCGTGCTGGAAGTGTT 
TCCCTCCAGTGAAGCCCTCCGCCCTTTGTTCACCCTGGGCATCGAAGCCAGCTCGGAAGCTCAGT 
TT TACACCAAAGGTGACCAACTTATACTCAACT TGAATAACATCAGCT CTGATCGGATCCAGCTG 
ATGAACTCTGGGATTGGCTGGTTCCAACCTGATGTTCTGAAAAACATCATCACTGAGATCATCCA 
CTCCATCCTGCTGCCGAACCAGAATGGCAAATTAAGATCTGGGGTCCCAGTGTCATTGGTGAAGG 
CCTTGGGATTCGAGGCAGCTGAGTCCTCACTGACCAAGGATGCCCTTGTGCTTACTCCAGCCTCC 
TTGTGGAAACCCAGCTCTCCTGTCTCCCAGTGAAGACTTGGATGGCAGCCATCAGGGAAGGCTGG 
GTCCCAGCTGGGAGTATGGGTGTGAGCTCTATAGACCATCCCTCTCTGCAATCAATAAACACTTG 
CCTGTGAAAAA 
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FIGURE 78 

MAGPWTFTLLCGLLAATLIQATLSPTAVLILGPKVIKEKLTQELKDHNATSILQQLPLLSAMREK 
PAGG I PVLGSLVNTVLKH 1 1 WLKVI TANI LQLQVKPSANDQELLVKI PLDMVAGFNT PLVKT I VE 
FHMTTEAQATIRMDTSASGPTRLVLSDCATSHGSLRIQLLYKLSFLVNALAKQVMNLLVPSLPNL 
VKNQLCPVI EAS FNGMYADLLQLVKVP I SLS IDRLE FDLLYPAI KGDT I QLYLGAKLLDSQGKVT 
KWFNNSAASLTMPTLDNIPFSLIVSQDVVK^ 

EKAADKLGSTQIVKILTQDTPEFFIDQGHAKVAQLIVLEVFPSSEALRPLFTLGIEASSEAQFYT 
KGDQL I LNLNNI S S DR I QLMNS GI GWFQPDVLKN I 1 TE I IHS I LLPNQNGKLRS GVPVSLVKALG 
FEAAE S S LTKDAL VL T PAS LWKP S S P VS Q 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites . 

amino acids 48-51, 264-267, 401-404 

Glycosaminoglycan attachment site. 

amino acids 412-415 

I»BP / BPI / CETP family proteins. 

amino acids 407-457 
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FIGURE 7Q 



GAGAGAAGTCAGCCTGGCAGAGAGACT CT GAAATGAGGGATTAGAGGT GT TCAAGGAGCAAGAGC 
T T CAGCCT GAAGACAAGGGAGCAGTCCCTGAAGACGC TTC TAC T GAGAGGT CTGCCATGGCC TCT 
CTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACTGGTTGCCAT 
GCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAGCAGTTGGCTTCT 
CCAAGGGCC TC T GGATGGAATGT GCCACACACAGCACAGGCAT CACCCAGTGTGACAT CTATAGC 
ACCCTTCTGGGCCTGCCGGCTGACATCCAGGCTGCCCAGGCCATGATGGTGACATCCAGTGCAAT 
CTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCACAGTCTTCTGCCAGGAATCCC 
J! GAGCCAAAGACAGAGTGGCGGTAGCAGGTGGAGTCTTTTTCATCCTTGGAGGCCTCCTGGGATTC 
In ATTCCTGTTGCCTGGAATCTTCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAG 
m CATGAAATTTGAGATTGGAGAGGCTCTTTACTTGGGCATTATTTCTTCCCTGTTCTCCCTGATAG 
iy CTGGAATCATCCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGCTCCAACTACTACGATGCCTAC 

U 8 CAAGCCCAACCTCTTGCCACAAGGAGCTCTCCAAGGCCTGGTCAACCTCCCAAAXjTCAAGAGTGA 

n; 

l:f GTTCAATTCCTACAGCCTGACAGGGTATGTG TGAA GAACCAGGGGCCAGAGCTGGGGGGTGGCTG 
s G GT C T G T GAAAAAC AG T GGACAG C AC C C C GAGG GCCACAGG T GAGGGACAC TAC C AC T GGAT C GT 

□ G TCAGAAGGTGC TGC T GAGGAT AGAC T GAC T TT GGCCATTGGAT TGAGCAAAGGCAGAAATGGGG 

111 GCTAGTGTAACAGCATGCAGGTTGAATTGCCAAGGATGCTCGCCATGCCAGCCTTTCTGTTTTCC 
J-* TCACCTTGCTGCTCCCCTGCCCTAAGTCCCCAACCCTCAACTTGAAACCCCATTCCCTTAAGCCA 

GGACTCAGAGGATCCCTTTGCCCTCTGGTTTACCTGGGACTCCATCCCCAAACCCACTAATCACA 
pi TCCCACTGACTGACCCTCTGTGATCAAAGACCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCATT 

GCTGGGGATGGGAAGGAGAAGCAGTGGCTTTTGTGGGCATTGCTCTAACCTACTTCTCAAGCTTC 
CCTCCAAAGAAACTGATTGGCCCTGGAACCTCCATCCCACTCTTGTTATGACTCCACAGTGTCCA 
GACTAAT T T GT GCATGAAC TGAAATAAAACCAT CCTACGGTAT CCAGGGAACAGAAAGCAGGAT G 
CAGGATGGGAGGACAGGAAGGCAGCCTGGGACATTTAAAAAAATA 
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FIGURE 80 

MASLGLQLVGYILGLLGLLGTLVAMLLPSWKTSSYVGASIVTAVGFSKGLWMECATHSTGITQCD 
I YSTLLGLPAD I QAAQAMMVTSSAI S SLACI I S WGMRCT VFCQESRAKDRVAVAGGVFFI LGGL 
LGFIPVAWNLHGILRDFYSPLVPDSMKFEIGEALYLGIISSLFSLIAGIILCFSCSSQRNRSNYY 
DAYQAQPLATRS S PRPGQPPKVKSE FNS YS LTGYV 

Important features of the protein: 
Signal peptide: 

amino acids 1-24 

Transmembrane domains: 

amino acids 82-102, 117-140, 163-182 

N-glycosylation site. 

amino acids 190-193 

PMP-22 / EMP / MP20 family proteins. 

amino acids 46-59 
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FIGURE 81 



■- y 



CCCACGCGTCCGCGCCTCTCCCTTCTGCTGGACCTTCCTTCGTCTCTCCATCTCTCCCTCCTTTC 
CCCGCGTTCTCTTTCCACCTTTCTCTTCTTCCCACCTTAGACCTCCCTTCCTGCCCTCCTTTCCT 
GCCCACCGCTGCTTCCTGGCCCTTCTCCGACCCCGCTCTAGCAGCAGACCTCCTGGGGTCTGTGG 
GTTGATCTGTGGCCCCTGTGCCTCCGTGTCCTTTTCGTCTCCCTTCCTCCCGACTCCGCTCCCGG 
ACCAGCGGCCTGACCCTGGGGAAAGGATGGTTCCCGAGGTGAGGGTCCTCTCCTCCTTGCTGGGA 
CTCGCGCTGCTCTGGTTCCCCCTGGACTCCCACGCTCGAGCCCGCCCAGACATGTTCTGCCTTTT 
CCATGGGAAGAGATACTCCCCCGGCGAGAGCTGGCACCCCTACTTGGAGCCACAAGGCCTGATGT 
ACTGCCTGCGCTGTACCTGCTCAGAGGGCGCCCATGTGAGTTGTTACCG.CCTCCACTGTCCGCCT 
GTCCACTGCCCCCAGCCTGTGACGGAGCCACAGCAATGCTGTCCCAAGTGTGTGGAACCTCACAC 
TCCCTCTGGACTCCGGGCCCCACCAAAGTCCTGCCAGCACAACGGGACCATGTACCAACACGGAG 
AGATCTTCAGTGCCCATGAGCTGTTCCCCTCCCGCCTGCCCAACCAGTGTGTCCTCTGCAGCTGC 
ACAGAGGGCCAGATCTACTGCGGCCTCACAACCTGCCCCGAACCAGGCTGCCCAGCACCCCTCCC 
AC TGCCAGACTCC TGCTGCCAAGCCT GCAAAGAT GAGGCAAGT GAGCAATCGGAT GAAGAGGACA 
GT GT GCAGTCGC TCCATGGGGT GAGACATCC TCAGGAT CCATGT T CCAGTGATGC TGGGAGAAAG 
AGAGGCCCGGGCACCCCAGCCCCCACTGGCCTCAGCGCCCCTCTGAGCTTCATCCCTCGCCACTT 
CAGACCCAAGGGAGCAGGCAGCACAACTGTCAAGATCGTCCTGAAGGAGAAACATAAGAAAGCCT 
GTGTGCATGGCGGGAAGACGTACTCCCACGGGGAGGTGTGGCACCCGGCCTTCCGTGCCTTCGGC 
CCCTTGCCCTGCATCCTATGCACCTGTGAGGATGGCCGCCAGGACTGCCAGCGTGTGACCTGTCC 
CACCGAGTACCCCTGCCGTCACCCCGAGAAAGTGGCTGGGAAGTGCTGCAAGATTTGCCCAGAGG 
ACAAAGCAGACCCTGGCCACAGTGAGATCAGTTCTACCAGGTGTCCCAAGGCACCGGGCCGGGTC 
CTCGTCCACACATCGGTATCCCCAAGCCCAGACAACCTGCGTCGCTTTGCCCTGGAACACGAGGC 
CTCGGACT TGGTGGAGATC TACCT CTGGAAGCTGGTAAAAGATGAGGAAACTGAGGCTCAGAGAG 
GTGAAGTACCTGGCCCAAGGCCACACAGCCAGAATC T T CCACTT GAC TCAGATCAAGAAAGT CAG 
GAAGCAAGACTTCCAGAAAGAGGCACAGCACTTCCGACTGCTCGCTGGCCCCCACGAAGGTCACT 
GGAACGTCTTCCTAGCCCAGACCCTGGAGCTGAAGGTCACGGCCAGTCCAGACAAAGTGACCAAG 
ACAT AACAAAGACCTAACAGT TGCAGATAT GAGC TGTATAATTGT TGT T AT TATATAT TAATAAA 
TAAGAAGTTGCATTACCCTCAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 82 



MVPEVRVLSSLLGLALLWFPLDSHARARPDMFCLFHGKRYSPGESWHPYLEPQGLMYCLRCTCSE 
GAHVSCYRLHCPPVHCPQPVTEPQQCCPKCVEPHTPSGLRAPPKSCQHNGTMYQHGEIFSAHELF 
PSRLPNQCVLCSCTEGQIYCGLTTCPEPGCPAPLPLPDSCCQACKDEASEQSDEEDSVQSLHGVR 
HPQDPCSSDAGRKRGPGTPAPTGLSAPLSFIPRHFRPKGAGSTTVKIVLKEKHKKACVHGGKTYS 
HGEVWHPAFRAFGPLPCILCTCEDGRQDCQRVTCPTEYPCRHPEKVAGKCCKICPEDKADPGHSE 
ISSTRCPKAPGRVLVHTSVSPSPDNLRRFALEHEASDLVEIYLWKLVKDEETEAQRGEVPGPRPH 
SQNLPLDSDQESQEARLPERGTALPTARWPPRRSLERLPSPDPGAEGHGQSRQSDQDITKT 

Signal peptide: 

amino acids 1-25 
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FIGURE 82 



GACAGCTGTGTCTCGATGGAGTAGACTCTCAGAACAGCGCAGTTTGCCCTCCGCTCACGCAGAGCCTCTCC 
GTGGCTTCCGCACCTTGAGCATTAGGCCAGTTCTCCTCTTCTCTCTAATCCATCCGTCACCTCTCCTGTCA 
TCCGTTTCCATGCCGTGAGGTCCATTCACAGAACACATCCATGGCTCTCATGCTCAGTTTGGTTCTGAGTC 
TCCTCAAGCTGGGATCAGGGCAGTGGCAGGTGTTTGGGCCAGACAAGCCTGTCCAGGCCTTGGTGGGGGAG 
GACGCAGCATTCTCCTGTTTCCTGTCTCCTAAGACCAATGCAGAGGCCATGGAAGTGCGGTTCTTCAGGGG 
CC AGTTCTCT AGCGTGGT CC ACCT CT AC AGGGAC GGGAAGGACC AG CCAT T TAT GC AGAT GC CACAGT AT C 
AAGGCAGG AC AAAACT GGT G AAGGATTC TATTGCGG AGGGGCGC AT CT CT CT GAGGCT GGAAAACATT ACT 
i c ;. GTGT TGGAT GCT GGCCTCT ATGGGT GC AGG AT T AGT T CC C AGTC TT ACT ACC AGAAGGCC AT CT GGGAGCT 

Q ACAGGT GT C AGC ACTGGGCTCAGT T CCT CT CAT T TCCAT C ACGGGAT ATGTT GAT AGAGACAT CCAGCT AC 

O TCTGTCAGTCCTCGGGCTGGTTCCCCCGGCCCACAGCGAAGTGGAAAGGTCCACAAGGACAGGATTTGTCC 
ill ACAGACTCCAGGACAAACAGAGACATGCATGGCCTGTTTGATGTGGAGATCTCTCTGACCGTCCAAGAGAA 
W CGCCGGGAGCATATCCTGTTCCATGCGGCATGCTCATCTGAGCCGAGAGGTGGAATCCAGGGTACAGATAG 

GAGATACCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACTCTGCTGTGGCCTA 
T TTT TT GGCATT GT T GGACTGAAG ATT T TC TTCTCC AAAT TCCAGT GGAAAATCC AGGCGGAAC TGGACT G 
GAGAAGAAAGCACGGACAGGCAGAATT GAGAGAC GC CCGG AAAC ACGC AGTGGAGGT GACTCT GGATC CAG 
AGACGGCTCACCCGAAGCTCTGCGTTTCTGATCTGAAAACTGTAACCCATAGAAAAGCTCCCCAGGAGGTG 
CC TCACTCTGAG AAGAGAT TTAC AAGGAAGAGT GT GGTGGCT TCTC AG AG TT T C CAAGCAGGGAAACATT A 
CTGGG AGGTGGACGGAGGAC ACAAT AAAAG GTGGCGCGTGGG AGTGTGC CGGGAT G AT GTGG AC AG GAGG A 
AGGAGT ACGT GAC T TT GTCTCCCGATC ATGGGT ACT GGGT CCTC AG AC TGAATGGAGAAC ATTT GT AT TT C 
ACATTAAATCCCCGTTTTATCAGCGTCTTCCCCAGGACCCCACCTACAAAAATAGGGGTCTTCCTGGACTA 
TGAGTGTGGGACCATCTCCTTCTTCAACATAAATGACCAGTCCCTTATTTATACCCTGACATGTCGGTTTG 
AAGGCTTATT GAGGCC CT AC AT TG AGT ATC CGTCCT AT AATGAGCAAAATGG AACT C CCAT AGTC ATC TGC 
C C AGTCACCC AGGAAT CAGAGAAAGAGG CC TCTT GGC AAAGGGCCT CT GC AATC CCAGAGAC AAGCAACAG 
T GAGTCCTCCTC ACAGGC AACCACGCCC TTCCTCCC C AGGGGT GAAAT G TAGG ATGAATC AC AT CCCAC AT 
TCTTCTTTAGGGATATTAAGGTCTCTCTCCCAGATCCAAAGTCCCGCAGCAGCCGGCCAAGGTGGCTTCCA 
GATGAAGGGGGACT GGCCT GTCCAC ATG GG AGTC AGGTGT CATGGCT G CC CT GAGC T GGGAGGGAAGAAGG 
CT GACATT AC AT TTAGTT T GCTCT CACT CCATCT GGCT AAGT GATCT TGAAAT ACC ACCT CT C AGGTGAAG 
AACCGT CAGG AATTCCCAT CTCAC AGGCTGTGGT GT AGATTAAGTAGACAAGGAAT GT GAAT AATGCT TAG 
ATCTTATTGATGACAGAGTGTATCCTAATGGTTTGTTCATTATATTACACTTTCAGTAAAAAAA 
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FIGURE Sd 

MMiMLSLVLSLLKLGSGQWQVFGPDKPVQALVGEDAAFSCFLSPKTNAEAMEVRFFRGQFSSVVH 
L YRDGKDQP FMQMPQYQGRTKLVKDS IAEGRI S LRLENI T VT.DAGLYGCRI S SQS YYQKAI WELQ 
VSALGSVPLI S I TGYVDRDI QLLCQS SGW FPRPTAKWKGPQGQDLSTDSRTNRDMHGLFDVE I SL 
T VQENAGS ISC SMRHAHL SRE VE SRVQ I GDT FFE P I S WHLATKVLG I LCCGLFFG I VGLKI FFSK 
FQWKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQS FQAGKHYWEVDGGHNKRWRVGVCRDDVDRRKEYVTLS PDHGYWVLRLNGEHL YFT 
LNPRFI SVFPRTPPTKIGVFLDYECGT I S FFNINDQSL I YTLTCRFEGLLRPY IEYPS YNEQNGT 
PIVICPVTQESEKEASWQRASAIPETSNSESSSQATTPFLPRGEM 

Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 239-255 
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FIGURE 8* 



AACAGACGTTCCCTCGCGGCCCTGGCACCTCTAACCCCAGACATGCTGCTGCTGCTGCTGCCCCT 
GCTCTGGGGGAGGGAGAGGGCGGAAGGACAGACAAGTAAACTGCTGACGATGCAGAGTTCCGTGA 
CGGTGCAGGAAGGCCTGTGTGTCCATGTGCCCTGCTCCTTCTCCTACCCCTCGCATGGCTGGATT 
TACCCTGGCCCAGTAGTTCATGGCTACTGGTTCCGGGAAGGGGCCAATACAGACCAGGATGCTCC 
AGTGGCCACAAACAACCCAGCTCGGGCAGTGTGGGAGGAGACTCGGGACCGATTCCACCTCCTTG 
GGGACCCACATACCAAGAATTGCACCCTGAGCATCAGAGATGCCAGAAGAAGTGATGCGGGGAGA 
TACT TCT T T CG TATGGAGAAAGGAAG T ATAAAAT GGAAT TATAAACAT CACCGGCTCTCTGTGAA 
TGTGACAGCCTTGACCCACAGGCCCAACATCCTCATCCCAGGCACCCTGGAGTCCGGCTGCCCCC 
Q AGAATCTGACCTGCTCTGTGCCCTGGGCCTGTGAGCAGGGGACACCCCCTATGATCTCCTGGATA 
fl GGGACCTCCGTGTCCCCCCTGGACCCCTCCACCACCCGCTCCTCGGTGCTCACCCTCATCCCACA 
?!f GCCCCAGGACCATGGCACCAGCCTCACCTGTCAGGTGACCTTCCCTGGGGCCAGCGTGACCACGA 

«! 

. n ACAAGACCGTCCATCTCAACGTGTCCTACCCGCCTCAGAACTTGACCATGACTGTCTTCCAAGGA 
J* GACGGCACAGTATCCACAGTCTTGGGAAATGGCTCATCTCTGTCACTCCCAGAGGGCCAGTCTCT 
M GCGCCTGGTCTGTGCAGTTGATGCAGTTGACAGCAATCCCCCTGCCAGGCTGAGCCTGAGCTGGA 

GAGGCCTGACCCTGTGCCCCTCACAGCCCTCAAACCCGGGGGTGCTGGAGCTGCCTTGGGTGCAC 
CTGAGGGATGCAGCTGAATTCACCTGCAGAGCTCAGAACCCTCTCGGCTCTCAGCAGGTCTACCT 
GAACGTCTCCCTGCAGAGCAAAGCCACATCAGGAGTGACTCAGGGGGTGGTCGGGGGAGCTGGAG 
Q CCACAGCCCTGGTCTTCCTGTCCTTCTGCGTCATCTTCGTTGTAGTGAGGTCCTGCAGGAAGAAA 
fll TCGGCAAGGCCAGCAGCGGGCGTGGGAGATACGGGCATAGAGGATGCAAACGCTGTCAGGGGTTC 

AGCCTCTCAGGGGCCCCTGACTGAACCTTGGGCAGAAGACAGTCCCCCAGACCAGCCTCCCCCAG 
CTTCTGCCCGCTCCTCAGTGGGGGAAGGAGAGCTCCAGTATGCATCCCTCAGCTTCCAGATGGTG 
AAGCCTTGGGACTCGCGGGGACAGGAGGCCACTGACACCGAGTACTCGGAGATCAAGATCCACAG 
ATGAGAAAC TGCAGAGAC TCACCC TGATT GAGGGATCACAGCCCC TCCAGGCAAGGGAGAAGTCA 
GAGGCTGATTCTTGTAGAATTAACAGCCCTCAACGTGATGAGCTATGATAACACTATGAATTATG 
T GCAGAGT GAAAAGCACACAGGC T T TAGAGT CAAAGTATC T CAAACC T GAAT CCACACTGTGCCC 
TCCCTTTTATTTTTT T AAC T AAAAG AC AG AC AAAT T C C T A 



o 
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FIGURE 86 



MLLLLLPLLWGRERAEGQTSKLLTMQSSVTVQEGLCVHVPCSFSYPSHGWIYPGPWHGYWFREG 
ANTDQDAPVATNNPARAVWEETRDRFHLLGDPHTKNCTLS IRDARRSDAGRYFFRMEKGS IKWNY 
KHHRLSVNVTALTHRPNILIPGTLESGCPQNLTCSVPWACEQGTPPMISWIGTSVSPLDPSTTRS 
SVLTLIPQPQDHGTSLTCQVTFPGASVTTNKTVHLNVSYPPQNLTMTVFQGDGTVSTVLGNGSSL 
SLPEGQSLRLVCAVDAVDSNPPARLSLSWRGLTLCPSQPSNPGVLELPWVHLRDAAEFTCRAQNP 
LGSQQVYLNVSLQSKATSGVTQGWGGAGATALVFLSFCVIFVWRSCRKKSARPAAGVGDTGIE 
DANAVRGSASQGPLTEPWAEDSPPDQPPPASARSSVGEGELQYASLSFQMVKPWDSRGQEATDTE 
YSEIKIHR 

Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 351-370 
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FIGURE 87 

AGAAAGCTGCACTCTGTTGAGCTCCAGGGCGCAGTGGAGGGAGGGAGTGAAGGAGCTCTCTGTAC 
C C AAG G AAAG T G C AGC T GAG AC T C AG AG AAG AT T AC A AT GA AC C AAC T C AG CTTCCTGCTGTTTC 
TCATAGCGACCACCAGAGGATGGAGTACAGATGAGGCTAATACT TACT TCAAGGAATGGACC TGT 
TCTTCGTC TCCAT C T C T GCCCAGAAGCT GCAAGGAAAT CAAAGACGAAT GT CC T AGTGCAT T TGA 
TGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCTGTGACATGACCTCTG 
GGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATGCGTGGGAAGTGCACGGTG 
GGCGATCGCTGGTCCAGTCAGCAGGGCAGCAAAGCAGACTACCCAGAGGGGGACGGCAACTGGGC 
M» CAACTACAACACCTT TGGATCTGCAGAGGCGGCCACGAGCGAT GACTACAAGAACCCT GGCTACT 

y ACGACATCCAGGCCAAGGACCTGGGCATCTGGCACGTGCCCAATAAGTCCCCCATGCAGCACTGG 

AGAAACAGCTCCCTGCTGAGGTACCGCACGGACACTGGCTTCCTCCAGACACTGGGACATAATCT 
jij GT T T GGCAT CTACCAGAAATAT CCAGTGAAATAT GGAGAAGGAAAGT GT T GGACT GACAACGGCC 

Iff CGGTGATCCCTGTGGTCTATGATTTTGGCGACGCCCAGAAAACAGCATCTTATTACTCACCCTAT 
%U G G C CAGCGG GAAT T C AC T GCGGGAT T T GT T C AG T T CAGGG T AT T T AAT AAC GAGAGAGCAGCC AA 

$.7. 

^ CGCCTTGTGTGCTGGAATGAGGGTCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAG 

««. GATACTTTCCAGAGGCCAGTCCCCAGCAGTGTGGAGATTTTTCTGGTTTTGATTGGAGTGGATAT 

HI GGAACT CAT G T TGGT TACAGCAGCAGCCGT GAGATAAC TGAGGCAGCT GTGCT TC TAT TC TATCG 

Q T TGA G AG TTTTGTGG GAGGG AAC C C AG AC CTCTCCTCC C AAC CAT GAG AT C C C AAG GAT G GAG AA 

■tti . CAACTTACCCAGTAGCTAGAATGT TAATGGCAGAAGAGAAAACAATAAATCATAT TGAC TCAAGA 

AAAAAA 

111 
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FIGURE 88 



MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTENGVI 
YQTFCDMTSGGGGWTLVASVHENDMRGKCTVGDRWSSQQGSKADYPEGDGNWANYNTFGSAEAAT 
S DDYKNPG Y YD I QAKDLG I WHVPNKS PMQHWRNS S LLRYRT DT G FLQTLGHNL FG I YQKYP VKYG 
EGKCWTDNGPVIPWYDFGDAQKTASYYSPYGQREFTAGFVQFRVFNNERAANALCAGMRVTGCN 
TEHHCIGGGGYFPEASPQQCGDFSGFDWSGYGTHVGYSSSREITEAAVLLFYR 

Important features: 
Signal peptide: 

amino acids 1-16 

N-glycosylation site. 

amino acids 163-167 

Glycosaminoglycan attachment sites. 

amino acids 74-78, 289-293 

N-myristoylation sites. 

amino acids 76-82, 115-121, 124-130, 253-259, 292-298 
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FIGURE 8q 

CTAGATTTGTCGGCTTGCGGGGAGACTTCAGGAGTCGCTGTCTCTGAACTTCCAGCCTCAGAGAC 
CGCCGCCCTTGTCCCCGAGGGCC ATG GGCCGGGTCTCAGGGCTTGTGCCCTCTCGCTTCCTGACG 
CTCCTGGCGCATCTGGTGGTCGTCATCACCTTATTCTGGTCCCGGGACAGCAACATACAGGCCTG 
CCTGCCTCTCACGTTCACCCCCGAGGAGTATGACAAGCAGGACATTCAGCTGGTGGCCGCGCTCT 
CTGTCACCCTGGGCCTCTTTGCAGTGGAGCTGGCCGGTTTCCTCTCAGGAGTCTCCATGTTCAAC 
AGCACCCAGAGCCTCATCTCCATTGGGGCTCACTGTAGTGCATCCGTGGCCCTGTCCTTCTTCAT 
ATTCGAGCGTTGGGAGTGCACTACGTATTGGTACATTTTTGTCTTCTGCAGTGCCCTTCCAGCTG 
T CACT GAAATGGC T T TAT T C GT CACCGTC T T TGGGC TGAAAAAGAAAC CCT T CTGAT TACC TTCA 
TGACGGGAACCTAAGGACGAAGCCTACAGGGGCAAGGGCCGCTTCGTATTCCTGGAAGAAGGAAG 
GCATAGGCTTCGGTTTTCCCCTCGGAAACTGCTTCTGCTGGAGGATATGTGTTGGAATAATTACG 
TCTTGAGTCTGGGATTATCCGCATTGTATTTAGTGCTTTGTAATAAAATATGTTTTGTAGTAACA 
T T AAGACT T ATAT ACAGT T T TAGGGGACAAT TAAAAAAAAAAAA 
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FIGURE qo 

MGRVSGLVPSRFLTLLAHLWVITLFWSRDSNIQACLPLTFTPEEYDKQDIQLVAALSVTLGLFA 
VELAGFLSGVSMFNSTQSLI S I GAHCSASVALS FFI FERWECTTYWYI FVFCSALPAVTEMALFV 
TVFGLKKKPF 

Transmembrane domain: 

amino acids 12-28 (type II), 51-66, 107-124 
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FIGURE qi 

CTGGGACCCCGAAAAGAGAAGGGGAGAGCGAGGGGACGAGAGCGGAGGAGGAAGATGCAACTGAC 
TCGCTGCTGCTTCGTGTTCCTGGTGCAGGGTAGCCTCTATCTGGTCATCTGTGGCCAGGATGATG 
GTCCTCCCGGCTCAGAGGACCCTGAGCGTGATGACCACGAGGGCCAGCCCCGGCCCCGGGTGCCT 
CGGAAGCGGGGCCACATCTCACCTAAGTCCCGCCCCATGGCCAATTCCACTCTCCTAGGGCTGCT 
GGCCCCGCCTGGGGAGGCTTGGGGCATTCTTGGGCAGCCCCCCAACCGCCCGAACCACAGCCCCC 
CACCCTCAGCCAAGGTGAAGAAAATCTTTGGCTGGGGCGACTTCTACTCCAACATCAAGACGGTG 
GCCCTGAACCTGCTCGTCACAGGGAAGATTGTGGACCATGGCAATGGGACCTTCAGCGTCCACTT 
CCAACACAATGCCACAGGCCAGGGAAACATCTCCATCAGCCTCGTGCCCCCCAGTAAAGCTGTAG 
AGTTCCACCAGGAACAGCAGATCTTCATCGAAGCCAAGGCCTCCAAAATCTTCAACTGCCGGATG 
GAGTGGGAGAAGGTAGAACGGGGCCGCCGGACCTCGCTTTGCACCCACGACCCAGCCAAGATCTG 
CTCCCGAGACCACGCTCAGAGCTCAGCCACCTGGAGCTGCTCCCAGCCCTTCAAAGTCGTCTGTG 
T C TACATCGCGT TC TACAGCACGGAC TAT CGGC TGGT CCAGAAGGTGTGCCCAGAT TACAAC TAC 
CATAGTGATACCCCCTACTACCCATCTGGGTGACCCGGGGCAGGCCACAGAGGCCAGGCCAGGGC 
TGGAAGGACAGGCCTGCCCATGCAGGAGACCATCTGGACACCGGGCAGGGAAGGGGTTGGGCCTC 
AGGCAGGGAGGGGGGTGGAGACGAGGAGATGCCAAGTGGGGCCAGGGCCAAGTCTCAAGTGGCAG 
AGAAAGGGTCCCAAGTGCTGGTCCCAACCTGAAGCTGTGGAGTGACTAGATCACAGGAGCACTGG 
AGGAGGAGTGGGCTCTCTGTGCAGCCTCACAGGGCTTTGCCACGGAGCCACAGAGAGATGCTGGG 
TCCCCGAGGCCTGTGGGCAGGCCGATCAGTGTGGCCCCAGATCAAGTCATGGGAGGAAGCTAAGC 
CCTTGGTTCTTGCCATCCTGAGGAAAGATAGCAACAGGGAGGGGGAGATTTCATCAGTGTGGACA 
GCCTGTCAACTTAGGATGGATGGCTGAGAGGGCTTCCTAGGAGCCAGTCAGCAGGGTGGGGTGGG 
GCCAGAGGAGCTCTCCAGCCCTGCCTAGTGGGCGCCCTGAGCCCCTTGTCGTGTGCTGAGCATGG 
CATGAGGCTGAAGTGGCAACCCTGGGGTCTTTGATGTCTTGACAGATTGACCATCTGTCTCCAGC 
CAGGCCACCCCTTTCCAAAATTCCCTCTTCTGCCAGTACTCCCCCTGTACCACCCATTGCTGATG 
GCACACCCAT CC T T AAGC TAAGACAGGAC GAT T GT GG TCCT C CCAC AC TAAGGCCACAGCCCAT C 
CGCGTGCTGTGTGTCCCTCTTCCACCCCAACCCCTGCTGGCTCCTCTGGGAGCATCCATGTCCCG 
GAGAGGGGTCCCTCAACAGTCAGCCTCACCTGTCAGACCGGGGTTCTCCCGGATCTGGATGGCGC 
CGCCCTCTCAGCAGCGGGCACGGGTGGGGCGGGGCCGGGCCGCAGAGCATGTGCTGGATCTGTTC 
TGTGTGTCTGTCTGTGGGTGGGGGGAGGGGAGGGAAGTCTTGTGAAACCGCTGATTGCTGACTTT 
TGTGTGAAGAATCGTGTTCTTGGAGCAGGAAATAAAGCTTGCCCCGGGGCA 
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FIGURE Q2 

MQLTRCCFVFLVQGSLYLVICGQDDGPPGSEDPERDDHEGQPRPRVPRKRGHISPKSRPMANSTL 
LGLLAPPGEAWG I LGQP PNRPNHS PPPS AKVKK I FGWGDF YSNI KT VALNLLVTGKI VDHGNG T F 
SVHFQHNATGQGNISISLVPPSKAVEFHQEQQIFIEAKASKIFNCRMEWEKVERGRRTSLCTHDP 
AKICSRDHAQSSATWSCSQPFKWCVYIAFYSTDYRLVQKVCPDYNYHSDTPYYPSG 



Ml 



Important features of the protein: 
Signal peptide: 

amino acids 1-14 

N-glycosylation sites. 

amino acids 62-65, 127-130, 137-140, 143-146 

2-oxo acid dehydrogenases acyl transferase 

amino acids 61-71 
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FIGURE Q3 



CGGTGGCCATGACTGCGGCCGTGTTCTTCGGCTGCGCCTTCATTGCCTTCGGGCCTGCGCTCGCC 
CTTTATGTCTTCACCATCGCCATCGAGCCGTTGCGTATCATCTTCCTCATCGCCGGAGCTTTCTT 
CTGGTTGGTGTCTCTACTGATTTCGTCCCTTGTTTGGTTCATGGCAAGAGTCATTATTGACAACA 
AAGATGGACCAACACAGAAATATCTGCTGATCTTTGGAGCGTTTGTCTCTGTCTATATCGAAGAA 
ATGT T CCGAT T TGCATATTATAAACT C T TAAAAAAAGCCAGTGAAGG T T T GAAGAGTATAAACCC 
AGGTGAGACAGCACCCTCTATGCGACTGCTGGCCTATGTTTCTGGCTTGGGCTTTGGAATCATGA 
GTGGAGTATTTTCCTTTGTGAATACCCTATCTGACTCCTTGGGGCCAGGCACAGTGGGCATTCAT 
[f GGAGATTCTCCTCAATTCTTCCTTTATTCAGCTTTCATGACGCTGGTCATTATCTTGCTGCATGT 
S ATTCTGGGGCATTGTATTTTTTGATGGCTGTGAGAAGAAAAAGTGGGGCATCCTCCTTATCGTTC 
Iff TCCTGACCCACCTGCTGGTGTCAGCCCAGACCTTCATAAGTTCTTATTATGGAATAAACCTGGCG 
III TCAGCATTTATAATCCTGGTGCTCATGGGCACCTGGGCATTCTTAGCTGCGGGAGGCAGCTGCCG 
Ml AAGCCTGAAACTCTGCCTGCTCTGCCAAGACAAGAACTTTCTTCTTTACAACCAGCGCTCCAGAT 
% AACC T CAGGGAACCAGCAC T TCCCAAACCGCAGACTACATC T T T AGAGGAAGCACAAC T GT GCCT 

2 T T T TCT GAAAATCCCT T TT T CT GG TGGAAT T GAGAAAGAAATAAAAC TAT GCAGATA 
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FIGURE Q4 

MTAAVFFGCAF I AFGPALAL YVFT IAIEPLRIIFL I AGAFFWLVS LLISS LVWFMARVI I DNKDG 
PTQKYLLIFGAFVSVYIQEMFRFAYYKLLKKASEGLKSINPGETAPSMRLLAYVSGLGFGIMSGV 
FSFVNTLSDSLGPGTVGIHGDSPQFFLYSAFMTLVIILLHVFWGIVFFDGCEKKKWGILLIVLLT 
HLLVSAQTFISSYYGINLASAFIILVIMGTWAFLAAGGSCRSLKLCLLCQDKNFLLYNQRSR 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

Transmembrane domains: 

amino acids 32-51, 119-138, 152-169, 216-235 

Glycosaminoglycan attachment site. 

amino acids 120-123 

Sodium neurotransmitter symporter family protein 

amino acids 31-65 
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FIGURE Q5 

AATTTTTCACCAGAGTAAACTTGAGAAACCAACTGGACCTTGAGTATTGTACATTTTGCCTCGTG 
GACCCAAAGGTAGCAATCTGAAAC ATGA GGAGTACGAT TC TACTGTT TTGTCTTCTAGGATCAAC 
TCGGTCATTACCACAGCTCAAACCTGCTTTGGGACTCCCTCCCACAAAACTGGCTCCGGATCAGG 
GAACACTACCAAACCAACAGCAGTCAAATCAGGTCTT TCCT TCTT TAAGTCTGATACCATTAACA 
CAGAT GCT CACAC TGGGGCCAGAT CT GCAT CTG T TAAATCC TGCT GCAGGAATGACACC TGG TAC 
CCAGACCCACCCATT GACCC TGGGAGGGT T GAATGTACAACAGCAAC TGCACCCACATG TGT TAC 
CAAT T T T T G TCACACAAC T TGGAGC CCAGGGCAC TAT CC TAAGCT CAGAGGAATTGCCACAAATC 
TTCACGAGCCTCATCATCCATTCCTTGTTCCCGGGAGGCATCCTGCCCACCAGTCAGGCAGGGGC 
TAATCCAGATGTCCAGGATGGAAGCCTTCCAGCAGGAGGAGCAGGTGTAAATCCTGCCACCCAGG 
GAACCCCAGCAGGCCGCCTCCCAACTCCCAGTGGCACAGATGACGACTTTGCAGTGACCACCCCT 
GCAGGCATCCAAAGGAGCACACATGCCATCGAGGAAGCCACCACAGAATCAGCAAATGGAATTCA 
G TAA GCTGTTTCAAATTTTTTCAACTAAGCTGCCTCGAATTTGGTGATACATGTGAATCTTTATC 
AT TGAT TATATTATGGAATAGATTGAGACACAT TGGATAGTCT TAGAAGAAATTAATTCTTAATT 
TACCTGAAAATAT TC T TGAAAT TTCAGAAAATAT GT T CTAT GTAGAGAATC CCAACT T T TAAAAA 
CAAT AAT T CAAT GGATAAATCT GT CT T TGAAATATAACAT TAT GCT GCCTGGATGATAT GCATAT 
TAAAACATATTTGGAT^CTGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE q6 

MRSTILLFCLLGSTRSLPQLKPALGLPPTKLAPDQGTLPNQQQSNQVFPSLSLIPLTQM 
LTLGPDLHLLNPAAGMTPGTQTHPLTLGGLNVQQQLHPHVLPIFVTQLGAQGTILSSEE 
LP.QIFTSLIIHSLFPGGILPTSQAGANPDVQDGSLPAGGAGVNPATQGTPAGRLPTPSG 
TDDDFAVTTPAGIQRSTHAIEEATTESANGIQ 



Signal peptide: 

amino acids 1-16 

u 
O 
O 

m 
w 

o 

If! 

o 
uj 
a 
in 
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FIGURE Q7 



GCTCAAGTGCCCTGCCTTGCCCCACCCAGCCCAGCCTGGCCAGAGCCCCCTGGAGAAGGAGCTCT 
CTTCTTGCTTGGCAGCTGGACCAAGGGAGCCAGTCTTGGGCGCTGGAGGGCCTGTCCTGACCATG 
GTCCCTGCCTGGCTGTGGCTGCTTTGTGTCTCCGTCCCCCAGGCTCTCCCCAAGGCCCAGCCTGC 
AGAGCTGTCTGTGGAAGTTCCAGAAAACTATGGTGGAAATTTCCCTTTATACCTGACCAAGTTGC 
CGCTGCCCCGTGAGGGGGCTGAAGGCCAGATCGTGCTGTCAGGGGACTCAGGCAAGGCAACTGAG 
GGCCCATTTGCTATGGATCCAGATTCTGGCTTCCTGCTGGTGACCAGGGCCCTGGACCGAGAGGA 
GCAGGCAGAGTACCAGC TACAGGT CACCCT GGAGAT GCAGGATGGACAT GT CT TGTGGGGTCCAC 
AGCCTGTGCTTGTGCACGTGAAGGATGAGAATGACCAGGTGCCCCATTTCTCTCAAGCCATCTAC 
AGAGCTCGGCTGAGCCGGGGTACCAGGCCTGGCATCCCCTTCCTCTTCCTTGAGGCTTCAGACCG 
GGATGAGCCAGGCACAGCCAACTCGGATCTTCGATTCCACATCCTGAGCCAGGCTCCAGCCCAGC 
H CTTCCCCAGACATGTTCCAGCTGGAGCCTCGGCTGGGGGCTCTGGCCCTCAGCCCCAAGGGGAGC 
Q ACCAGCCTTGACCACGCCCTGGAGAGGACCTACCAGCTGTTGGTAGAGGTCAAGGACATGGGTGA 
Q CCAGGCCTCAGGCCACCAGGCCACTGCCACCGTGGAAGTCTCCATCATAGAGAGCACCTGGGTGT 
S CCCTAGAGCCTATCCACCTGGCAGAGAATCTCAAAGTCCTATACCCGCACCACATGGCCCAGGTA 
ft CACTGGAGTGGGGGTGATGTGCACTATCACCTGGAGAGCCATCCCCCGGGACCCTTTGAAGTGAA 

TGCAGAGGGAAACCTCTACGTGACCAGAGAGCTGGACAGAGAAGCCCAGGCTGAGTACCTGCTCC 
in AGGTGCGGGCTCAGAATTCCCATGGCGAGGACTATGCGGCCCCTCTGGAGCTGCACGTGCTGGTG 
J| AT GGAT GAGAAT GACAACGT GCC TATCTGCCCTCCCCGTGACCCCACAGT CAGCAT CCCT GAGCT 

X: CAGTCCACCAGGTACTGAAGTGACTAGACTGTCAGCAGAGGATGCAGATGCCCCCGGCTCCCCCA 

ATTCCCACGTTGTGTATCAGCTCCTGAGCCCTGAGCCTGAGGATGGGGTAGAGGGGAGAGCCTTC 
* w CAGGTGGACCCCACTTCAGGCAGTGTGACGCTGGGGGTGCTCCCACTCCGAGCAGGCCAGAACAT 
O CCTGCTTCTGGTGCTGGCCATGGACCTGGCAGGCGCAGAGGGTGGCTTCAGCAGCACGTGTGAAG 
!|| TCGAAGTCGCAGTCACAGATATCAATGATCACGCCCCTGAGTTCATCACTTCCCAGATTGGGCCT 
f*t ATAAGCCTCCCTGAGGATGTGGAGCCCGGGACTCTGGTGGCCATGCTAACAGCCATTGATGCTGA 
TTg CCTCGAGCCCGCCTTCCGCCTCATGGATTTTGCCATTGAGAGGGGAGACACAGAAGGGACTTTTG 

GCCTGGATTGGGAGCCAGACTCTGGGCATGTTAGACTCAGACTCTGCAAGAACCTCAGTTATGAG 
O GCAGCTCCAAGTCATGAGGTGGTGGTGGTGGTGCAGAGTGTGGCGAAGCTGGTGGGGCCAGGCCC 
fO AGGCCCTGGAGCCACCGCCACGGTGACTGTGCTAGTGGAGAGAGTGATGCCACCCCCCAAGTTGG 

AGCAGGAGAGCTACGAGGCCAGTGTCCCCATCAGTGCCCCAGCCGGCTCTTTCCTGCTGACCATC 
CAGCCCTCCGACCCCATCAGCCGAACCCTCAGGTTCTCCCTAGTCAATGACTCAGAGGGCTGGCT 
CTGCATTGAGAAATTCTCCGGGGAGGTGCACACCGCCCAGTCCCTGCAGGGCGCCCAGCCTGGGG 
ACACCTACACGGTGCTTGTGGAGGCCCAGGATACAGCCCTGACTCTTGCCCCTGTGCCCTCCCAA 
TACCTCTGCACACCCCGCCAAGACCATGGCTTGATCGTGAGTGGACCCAGCAAGGACCCCGATCT 
GGCCAGTGGGCACGGTCCCTACAGCTTCACCCTTGGTCCCAACCCCACGGTGCAACGGGATTGGC 
GCCTCCAGACTCTCAATGGTTCCCATGCCTACCTCACCTTGGCCCTGCATTGGGTGGAGCCACGT 
GAACACATAATCCCCGTGGTGGTCAGCCACAATGCCCAGATGTGGCAGCTCCTGGTTCGAGTGAT 
CGTGTGTCGCTGCAACGTGGAGGGGCAGTGCATGCGCAAGGTGGGCCGCATGAAGGGCATGCCCA 
CGAAGCTGTCGGCAGTGGGCATCCTTGTAGGCACCCTGGTAGCAATAGGAATCTTCCTCATCCTC 
ATT TT CACCCACT GGAC CATGT CAAGGAAGAAGGACCC GGAT CAACCAGCAGACAGCGTGCCCCT 
GAAGGCGACTGTC TGAA TGGCCCAGGCAGCTCTAGCTGGGAGCTTGGCCTCTGGCTCCATCTGAG 
TCCCCTGGGAGAGAGCCCAGCACCCAAGATCCAGCAGGGGACAGGACAGAGTAGAAGCCCCTCCA 
T CTGC CCT GGGGT GGAGGCAC CAT CACCAT CACCAGGCATGT CT GCAGAGCC TGGACACCAAC T T 
TATGGACTGCCCATGGGAGTGCTCCAAATGTCAGGGTGTTTGCCCAATAATAAAGCCCCAGAGAA 
CTGGGCTGGGCCCTATGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE q8 



MVPAWLWLLCVSVPQALPKAQPAELSVEVPENYGGNFPLYLTKLPLPREGAEGQIVLSGDSGKAT 
EGPFAMDPDSGFLLVTRALDREEQAEYQLQVTLEMQDGHVLWGPQPVLVHVKDENDQVPHFSQAI 
YRARLSRGTRPGIPFLFLEASDRDEPGTANSDLRFHILSQAPAQPSPDMFQLEPRLGALALSPKG 
STSLDHALERTYQLLVQVKDMGDQASGHQATATVEVS I IESTWVSLEPIHLAENLKVLYPHHMAQ 
VHWS GGDVHYHLES HPPGP FEVNAE GNL YVTRELDREAQAE YLLQVRAQNSHGEDYAAPLELHVL 
VMDENDNVPICPPRDPTVSIPELSPPGTEVTRLSAEDADAPGSPNSHWYQLLSPEPEDGVEGRA 
FQVDPTSGSVTLGVLPLRAGQNILLLVLAMDLAGAEGGFSSTCEVEVAVTDINDHAPEFITSQIG 
PISLPEDVEPGTLVAMLTAIDADLEPAFRLMDFAIERGDTEGTFGLDWEPDSGHVRLRLCKNLSY 
EAAPSHEWWVQSVAKLVGPGPGPGATATVT VLVERVMP PPKLDQES YEASVP I SAPAGS FLLT 
IQPSDPISRTLRFSLVNDSEGWLCIEKFSGEVHTAQSLQGAQPGDTYTVLVEAQDTALTLAPVPS 
QYLCTPRQDHGLIVSGPSKDPDLASGHGPYSFTLGPNPTVQRDWRLQTLNGSHAYLTLALHWVEP 
RE HI I P WVSHNAQMWQLLVRVI VCRCNVE GQCMRKVGRMKGMPTKLS AVG I LVGTLVAI G I FL I 
LI FTHWTMSRKKDPDQPADS VPLKATV 

Signal peptide: 

amino acids 1-18 

Transmembrane domain: 

amino acids 762-784 



APPJD=1 0063594 



Page 250 



FIGURE qq 



GGCTGACCGTGCTACATTGCCTGGAGGAAGCCTAA.GGAACCCAGGCATCCAGCTGCCCACGCCTG 
AGTCCAAGATT CT TC CCAGGAACACAAACGTAGGAGACCCACGC T CC TGGAAGCACCAGCC T TTA 
TCTCTTCACCTTCAAGTCCCCTTTCTCAAGAATCCTCTGTTCTTTGCCCTCTAAAGTCTTGGTAC 
ATCTAGGACCCAGGCATCTTGCT T TCCAGCCACAAAGAGACAGATGAAGATGCAGAAAGGAAATG 
T TCTCCTTATGTT TGGTCTACTAT TGCATT TAGAAGCTGCAACAAAT TCCAATGAGAC TAGCACC 
T CTGCCAACAC T GGATCCAGTGT GAT CTCCAGT GGAGCCAGCACAGC CACCAA.C TC TGGGT CCAG 
TGTGACCTCCAGTGGGGTCAGCACAGCCACCATCTCAGGGTCCAGCGTGACCTCCAATGGGGTCA 
GCATAGT CACCAACTC TGAGTT CCATACAACC TCCAGTGGGAT CAGCACAGCCACCAAC TC TGAG 
T T CAGCACAGCGTCCAGTGGGAT CAGCATAGCCACCAACTC TGAGTCCAGCACAACCT CCAGT GG 
GGCCAGCACAGCCAC CAACT CT GAGT CCAGCACACCC TCCAGT GGGGCCAGCACAGTCACCAACT 
CTGGGTCCAGTGTGACCTCCAGTGGAGCCAGCACTGCCACCAACTCTGAGTCCAGCACAGTGTCC 
H AGTAGGGCCAGCACT GCCACCAAC TC T GAGT C TAGCACAC TCT CCAGT GGGGCCAGCACAGCCAC 

Q CAACTCTGACTCCAGCACAACCTCCAGTGGGGCTAGCACAGCCACCAACTCTGAGTCCAGCACAA 
f*| CCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACAGTGTCCAGTAGGGCCAGCACT 
|X GCCACCAAC TC TGAGTCCAGCACAAC CT CCAGT GGGGCCAGCACAGCCAC CAAC TC TGAGT CCAG 

?;! AACGACCTCCAATGGGGCTGGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGGGGCCA 
W GCACAGCCACCAACT C TGAC TCCAGCACAGT GTCCAGTGGGGCCAGCACT GC CACCAAC TC TGAG 

III TCCAGCACGACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGG 
*jj GGCTAGCACAGCCAC CAAC T CT GACT CCAGCACAA.CC TCCAGTGGGGC CGGCACAGCCACCAACT 

j C TGAGTCCAGCACAGTGTCCAGT GGGATCAGCACAGT CACCAAT T CT GAGT C CAGCACACCC T CC 

AGTGGGGCCAACACAGCCACCAAC TC T GAG T CCAG TACGACCT CCAGT GGGGCCAACACAGCCAC 
CAACT CT GAGT CCAGCACAG TGT C CAGTGGGGCCAGCACTGCCAC CAAC TCT GAGT CCAGCACAA 
■C3 CCTCCAGTGGGGTCAGCACAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCTAGCACA 
I fl GCCACCAAC TC T GAC TCCAGCACAACCTCCAGTGAGGCCAGCACAGCCACCAAC TC TGAG TC TAG 

CACAGTGTCCAGTGGGATCAGCACAGTCACCAATTCTGAGTCCAGCACAACCTCCAGTGGGGCCA 
ACACAGCCACCAACTC TGGGTCCAGT GTGACCT C TGCAGGC TC T GGAACAGCAGCT CT GAC TGGA 
W ATGCACACAACTTCCCATAGTGCATCTACTGCAGTGAGTGAGGCAAAGCCTGGTGGGTCCCTGGT 
O GCCGTGGGAAATCTTCCTCATCACCCTGGTCTCGGTTGTGGCGGCCGTGGGGCTCTTTGCTGGGC 
Iff TCTTCTTCTGTGTGAGAAACAGCCTGTCCCTGAGAAACACCTTTAACACAGCTGTCTACCACCCT 

CATGGCCTCAACCATGGCCTTGGTCCAGGCCCTGGAGGGAATCATGGAGCCCCCCACAGGCCCAG 
GT GGAGTCC TAAC TGGT TC TGGAGGAGACCAGTAT CATCGATAGCCAT GGAGAT GAGCGGGAGGA 
ACAGCGGGCCCTGAGCAGCCCCGGAAGCAAGTGCCGCATTCTTCAGGAAGGAAGAGACCTGGGCA 
CCCAAGACCTGGTTTCCTTTCATTCATCCCAGGAGACCCCTCCCAGCTTTGTTTGAGATCCTGAA 
AATCT T GAAGAAGGTAT T CC TCACCTT TC T T GCCT T TACCAGACACT GGAAAGAGAATAC TATAT 
T GCT CAT T TAGC TAAGAAATAAATACATC T CATCTAACACACACGACAAAGAGAAGCTGTGC T T G 
C C CCGGGG TGGGTATC TAGCTCT GAGATGAACT CAGT TATAGGAGAAAACC TCCATGC T GGAC T C 
CATCT GGCAT TCAAAATC T CCACAGTAA?UIT CCAAAGACC TCAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 100 



MKMQKGNVLLMFGLLLHLEAATNSNETSTSANTGSSVISSGASTATNSGSSVTSSGVSTATISGS 
SVTSNGVSIVTNSEFHTTSSGISTATNSEFSTASSGISIATNSESSTTSSGASTATNSESSTPSS 
GASTVTNSGSSVTSSGASTATNSESSTVSSRASTATNSESSTLSSGASTATNSDSSTTSSGASTA 
TNS E S S T T S S GAS T ATNSE S S TVS S RAS TATNS E S S T T S S GAS T ATNSE S RT T SNGAGT ATNS E S 
STTSSGASTATNSDSSTVSSGASTATNSESSTTSSGASTATNSESSTTSSGASTATNSDSSTTSS 
GAGTATNSESSTVSSGISTVTNSESSTPSSGANTATNSESSTTSSGANTATNSESSTVSSGASTA 
TNSESSTTSSGVSTATNSESSTTSSGASTATNSDSSTTSSEASTATNSESSTVSSGISTVTNSES 
STTSSGANTATNSGSSVTSAGSGTAALTGMHTTSHSASTAVSEAKPGGSLVPWEIFLITLVSWA 
AVGLFAGLFFCVRNSLSLRNTFNTAVYHPHGLNHGLGPGPGGNHGAPHRPRWSPNWFWRRPVSSI 
AMEMSGRNSGP 

Signal peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 510-532 
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FIGURE 101 



f! I 



GGCCGGACGCCTCCGCGTTACGGGATGAATTAACGGCGGGTTCCGCACGGAGGTTGTGACCCCTA 
CGGAGCCCCAGCTTGCCCACGCACCCCACTCGGCGTCGCGCGGCGTGCCCTGCTTGTCACAGGTG 
GGAGGCTGGAACTATCAGGCTGAAAAACAGAGTGGGTACTCTCTTCTGGGAAGCTGGCAACAAAT 
GGATGATGTGATATATGCATTCCAGGGGAAGGGAAATTGTGGTGCTTCTGAACCCATGGTCAATT 
AACGAGGCAGT T TCTAGCTACTGCACGTACT TCATAAAGCAGGACTC TAAAAGC T T TGGAAT CAT 
GGTGTCATGGAAAGGGATTTACTTTATACTGACTCTGTTTTGGGGAAGCTTTTTTGGAAGCATTT 
TCATGCTGAGTCCCTTTTTACCTTTGATGTTTGTAAACCCATCTTGGTATCGCTGGATCAACAAC 
CGCCTTGTGGCAACATGGCTGACCCTACCTGTGGCATTATTGGAGACCATGTTTGGTGTAAAAGT 
GAT T AT AAC T GG GGAT GCAT TTGTTCCTG GAGAAAGAAGT G T CAT TAT CAT GAAC CAT C GGACAA 
GAATGGAC T GGATGT TCC T GTGGAAT T GCC TGATGC GATATAGC T ACCT CAGAT T GGAGAAAATT 
TGCCTCAAAGCGAGTCTCAAAGGTGTTCCTGGATTTGGTTGGGCCATGCAGGCTGCTGCCTATAT 
CTTCATTCATAGGAAATGGAAGGAT GACAAGAGCCATT TCGAAGACATGAT TGAT TACT TT TGTG 
AT AT T C AC GAAC C AC T T C AAC T C C T CAT AT TCC C AGAAG G G AC T GAT C T C AC AGAAAAC AG C AAG 
T CTCGAAGTAATGCAT T TGC T GAAAAAAATGGAC T T CAGAAATAT GAATAT GT T T TACATC CAAG 
AACTACAGGCTTTACTTTTGTGGTAGACCGTCTAAGAGAAGGTAAGAACCTTGATGCTGTCCATG 
ATATCACTGTGGCGTATCCTCACAACATTCCTCAATCAGAGAAGCACCTCCTCCAAGGAGACTTT 
CCCAGGGAAAT CCAC T T TCACG T CCACCGGT ATC CAATAGACACCCT CCCCACAT CCAAGGAGGA 
CCTTCAACTCTGGTGCCACAAACGGTGGGAAGAGAAAGAAGAGAGGCTGCGTTCCTTCTATCAAG 
GGGAGAAGAAT TT TTATT T TACC GGACAGAGTGTCAT T CCACCT T GCAAG T C TGAACT CAGGGT C 
CTTGTGGTCAAATTGCTCTCTATACTGTATTGGACCCTGTTCAGCCCTGCAATGTGCCTACTCAT 
ATAT TTG TACAGTCT TGT TAAGT GGTAT T T TATAATCACCAT T GTAAT CT T T GTGCT GCAAGAGA 
GAATAT T T GGTGGAC TGGAGAT CAT AGAAC T TGCATGT TACCGACT T T TACACAAACAGCCACAT 
T TAAAT T CAAAGAAAAATGAGTAAGAT TATAAGGT T T GCCATGT GAAAACC TAGAGCATAT T T TG 
GAAAT GT T C T AAACC T T TC TAAG CT CAGATGCAT T T T TGCATGACTAT GT C GAATAT T T CT TACT 
GCCAT CAT TAT T TGT TAAAGATATT T T GCAC TTAAT T T TGT GGGAAAAATAT TGC TACAAT T T T T 
TTTAATCTCTGAATGTAATTTCGATACTGTGTACATAGCAGGGAGTGATCGGGGTGAAATAACTT 
GGGCCAGAATATTAT TAAACAATCAT CAGGCTT TTAAA 
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FIGURE 102 

MHSRGRE I WLLNPWS INEAVSS YCTYFIKQDSKS FGIMVSWKG I YFI LTLFWGS FFGS I FMLS P 
FLPLMFVNPSWYRWINNRLVATWLTLPVALLETMFGVKVI ITGDAFVPGERSVI IMNHRTRMDWM 
FLWNCLMRYSYLRLEKICLKASLKGVPGFGWAMQAAAuYIFIHRKWKDDKSHFEDMIDYFCDIHEP 
LQLLI FPEGTDLTENSKSRSNAFAEKNGLQKYEYVLHPRTTGFT FWDRLREGKNLDAVHDI TVA 
YPHNIPQSEKHLLQGDFPREIHFHVHRYPIDTLPTSKEDLQLWCHKRWEEKEERLRS FYQGEKNF 
YFTGQSVI PPCKSELRVLWKLLS I LYWTLFSPAMCLL I YLYSLVKWYFI I T I VI FVLQERI FGG 
LE 1 1 E LAC YRLLHKQPHLNS KKNE 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

Transmembrane domains: 

amino acids 44-63, 90-108, 354-377 
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FIGURE los 



CGGCTCGAGCGGCTCGAGTGAAGAGCCTCTCCACGGCTCCTGCGCCTGAGACAGCTGGCCTGACC 
T CCAAATCATCCATCCACCCCT GCTGTCAT CTG T T T T CATAGT GT GAGAT CAACCCACAGGAATA 
TCCATGGCTTTTGTGCTCATTTTGGTTCTCAGTTTCTACGAGCTGGTGTCAGGACAGTGGCAAGT 
CACTGGACCGGGCAAGTTTGTCCAGGCCTTGGTGGGGGAGGACGCCGTGTTCTCCTGCTCCCTCT 
TTCCTGAGACCAGTGCAGAGGCTATGGAAGTGCGGTTCTTCAGGAATCAGTTCCATGCTGTGGTC 
CACCT CTACAGAGATGGGGAAGACT GGGAAT CTAAGCAGAT GCCACAGTAT CGAGGGAGAAC TGA 
GTTTGTGAAGGACTCCATTGCAGGGGGGCGTGTCTCTCTAAGGCTAAAAAACATCACTCCCTCGG 
ACATCGGCCTGTATGGGTGCTGGTTCAGTTCCCAGATTTACGATGAGGAGGCCACCTGGGAGCTG 
CGGGTGGCAGCACTGGGCTCACTTCCTCTCATTTCCATCGTGGGATATGTTGACGGAGGTATCCA 
GTTACTCTGCCTGTCCTCAGGCTGGTTCCCCCAGCCCACAGCCAAGTGGAAAGGTCCACAAGGAC 
AGGATTTGTCTTCAGACTCCAGAGCAAATGCAGATGGGTACAGCCTGTATGATGTGGAGATCTCC 
J*!. ATTATAGTCCAGGAAAATGCTGGGAGCATATTGTGTTCCATCCACCTTGCTGAGCAGAGTCATGA 
Q GGTGGAATCCAAGGTATTGATAGGAGAGACGTTTTTCCAGCCCTCACCTTGGCGCCTGGCTTCTA 
Q TTTTACTCGGGTTACTCTGTGGTGCCCTGTGTGGTGTTGTCATGGGGATGATAATTGTTTTCTTC 
|I| AAAT C CAAAGGGAAAAT C CAGGC GGAACT GGAC T GGAGAAGAAAG C AC G GACAGGCAGAAT T GAG 

l'J % AGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAGAGACGGCTCACCCGAAGCTCTGCG 

TTTCTGATCTGAAAACTGTAACCCATAGAAAAGCTCCCCAGGAGGTGCCTCACTCTGAGAAGAGA 
l £\ TTTACAAGGAAGAGTGTGGTGGCTTCTCAGGGTTTCCAAGCAGGGAGACATTACTGGGAGGTGGA 
tfj CGT GGGACAAAATGTAGGGTGGT AT GT GGGAGT GT GT CGGGAT GACG TAGACAGGGGGAAGAACA 

,g ATGTGACTTTGTCTCCCAACAATGGGTATTGGGTCCTCAGACTGACAACAGAACATTTGTATTTC 

ACATTCAATCCCCATTTTATCAGCCTCCCCCCCAGCACCCCTCCTACACGAGTAGGGGTCTTCCT 
JL GGACTATGAGGGTGGGACCATCTCCTTCTTCAATACAAATGACCAGTCCCTTATTTATACCCTGC 
^ T GACATGTCAGTT TGAAGGCTTGT TGAGACCCTATATCCAGCAT GCGATGTATGACGAGGAAAAG 

Iff GGGAC TCC CATAT TCATAT GTCCAGT GTCC T GGGGATGAGACAGAGAAGACCCTGC TTAAAGGGC 

£□ CCCACACCACAGACCCAGACACAGCCAAGGGAGAGTGCTCCCGACAGGTGGCCCCAGCTTCCTCT 
|7j CCGGAGCCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGCTGAGGTTCTTCTGCCC 
fi TGAGC CCT GCAGCAGCGGCAGT CACAGCT T CCAGAT GAGGGGGGAT TGGCC T GACCCTGTGGGAG 

W TCAGAAGCCATGGCTGCCCTGAAGTGGGGACGGAATAGACTCACATTAGGTTTAGTTTGTGAAAA 
f|| CTCCATCCAGCTAAGCGATCTTGAACAAGTCACAACCTCCCAGGCTCCTCATTTGCTAGTCACGG 

ACAGT GAT TCCTGCCTCACAGGTGAAGATTAAAGAGACAACGAATGTGAATCATGCTTGCAGGTT 
TGAGGGCACAGTGTTTGCTAATGATGTGTTTTTATATTATACATTTTCCCACCATAAACTCTGTT 
T G C T TAT T C C AC AT T AAT T T AC T T T T C T C TAT AC CAAAT C AC C CAT G G AAT AG T TAT T GAAC AC C 
TGCT T T GT GAGGC TCAAAGAATAAAGAGGAGGTAGGAT TT T T CAC TGAT T CT ATAAGC CCAGCAT 
TACCTGATACCAAAACCAGGCAAAGAAAACAGAAGAAGAGGAAGGAAAACTACAGGTCCATATCC 
C T CAT T AAC AC AG AC AC AAAAAT T C T AAAT AAAAT T T T AAC AAAT T AAAC T AAAC AAT AT AT T T A 
AAGAT GATATATAACTAC T CAG T GTGGT T T GTCCCACAAATGCAGAGT TGGT TTAATATT TAAAT 
ATCAACCAGTGTAAT T CAGCACAT TAATAAAGTAAAAAAGAAAACCATAAAAAAAAAAAAAAA 
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FIGURE 1QA 

MAFVLILVLSFYELVSGQWQVTGPGKFVQALVGEDAVFSCSLFPETSAEAMEVRFFRNQFHAWH 
L YRDGE DWE SKQMPQYRGRTE FVKDS I AGGRVS LRLKNI T PSDI GL YGCW FS SQI YDEEAT WELR 
VAALGSLPLISIVGYVDGGIQLLCLSSGWFPQPTAKWKGPQGQDLSSDSRANADGYSLYDVEISI 
IVQENAGSILCSIHLAEQSHEVESECVLIGETFFQPSPWRLASILLGLLCGALCGVVMGMIIVFFK 
SKGKIQAELDWRRKHGQAELRDARECHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQGFQAGRHYWEVDVGQNVGWYVGVCRDDVDRGKNNVTLSPNNGYWVLRLTTEHLYFT 
FNPHFISLPPSTPPTRVGVFLDYEGGTISFFNTNDQSLIYTLLTCQFEGLLRPYIQHAMYDEEKG 
TPIFICPVSWG 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 131-150, 235-259 
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FIGURE los 

CCTT CACAGGACTCT T CAT T GCTGGTTG GC AATGA T GT AT CGGCCAGAT GTGGT GAGGGCTAGGAAAAGAG 
TTTGTTGGGAACCCTGGGTTATCGGCCTCGTCATCTTCATATCCCTGATTGTCCTGGCAGTGTGCATTGGA 

ct c act gt t cat t at gtgagat at aatcaaaagaagacct ac aatt ac t at agc ac attgtc at ttacaac 
tgac aaact atatgct gagttt ggcagagaggct tc taac aatt tt ac ag aaat gagccagagact tgaat 
caat ggtg aaaaat gc at tt tat aaatctc c att aagggaag aatt tg tcaagt ct caggtt at caagttc 
agtcaacagaagcatggagtgttggctcatatgctgttgatttgtagatttcactctactgaggatcctga 
aact gt ag at aaaat t gttc aact tgtt tt ac at gaaaagct gc aagat gct gt aggacc ccct aaagt ag 
atcctc act c agtt aaaat t aaaaaaat caac aagacagaaacagacagct at c t aaacc at t gct gc gga 
ac acgaagaagt aaaact ct aggt c agagt ctc agg atcgtt ggtgggac ag aagt agaagagggt gaatg 
::: gccctggcaggctagcctgcagtgggatgggagtcatcgctgtggagcaaccttaattaatgccacatggc 
ttgtgagtgctgctcactgttttacaacatataagaaccctgccagatggactgcttcctttggagtaaca 
1*5 1 ataaaaccttcgaaaatgaaacggggtctccggagaataattgtccatgaaaaatacaaacacccatcaca 
IP tgactatgatatttctcttgcagagctttctagccctgttccctacacaaatgcagtacatagagtttgtc 

ttt. 'i 

u| tccctgatgcatcctatgagtttcaaccaggtgatgtgatgtttgtgacaggatttggagcactgaaaaat 
JS; gatggttacagtcaaaatcatcttcgacaagcacaggtgactctcatagacgctacaacttgcaatgaacc 
* tcaagcttacaatgacgccataactcctagaatgttatgtgctggctccttagaaggaaaaacagatgcat 
O gccagggtgactctggaggaccactggttagttcagatgctagagatatctggtaccttgctggaatagtg 

agctggggagatgaatgtgcgaaacccaacaagcctggtgtttatactagagttacggccttgcgggactg 
T"i gattacttcaaaaactggtatc taa gagacaaaagcctcatggaacagataacatttttttttgttttttg 

ggtgtggaggccatttttagagatacagaattggagaagacttgcaaaacagctagatttgactgatctca 
|?i ataaactgtttgcttgatgcatgtattttcttcccagctctgttccgcacgtaagcatcctgcttctgcca 

gatcaactctgtcatctgtgagcaatagttgaaactttatgtacatagagaaatagataatacaatattac 
attacagcctgtattcatttgttctctagaagttttgtcagaattttgacttgttgacataaatttgtaat 
gcatatatacaatttgaagcactccttttcttcagttcctcagctcctctcatttcagcaaatatccattt 
tcaaggtgcagaacaaggagtgaaagaaaatataagaagaaaaaaatcccctacattttattggcacagaa 
aagtattaggtgtttttcttagtggaatattagaaatgatcatattcattatgaaaggtcaagcaaagaca 
gcagaataccaatcacttcatcatttaggaagtatgggaactaagttaaggaagtccagaaagaagccaag 
atatatccttattttcatttccaaacaactactatgataaatgtgaagaagattctgtttttttgtgacct 
ataataattatacaaacttcatgcaatgtacttgttctaagcaaattaaagcaaatatttatttaacattg 
ttactgaggatgtcaacatataacaataaaatataaatcaccca 



APP ID=1 0063594 



Page 257 of 320 



FIGURE 106 



MMYRPDVVRARKRVCWEPWIGLVIFISLIVLAVCIGLTVHYVRYNQKKTYNYYSTLSFTTDKLY 
AEFGREASNNFTEMSQRLESMVKNA 

PETVDKIVQLVLHEKLQDAVGPPKVDPHSVKIKKINKTETDSYLNHCCGTRRSKTLGQSLRIVGG 
TEVEEGEWPWQASLQWDGSHRCGATLINATWLVSAAHCFTTYKNPARWTASFGVTIKPSKMKRGL 
RRI I VHEKYKHPSHDYDI SLAELS SPVPYTNAVHRVCLPDAS YE FQPGDVMFVTG FGALKNDG YS 
QNHLRQAQVTL I DATTCNE PQAYNDAI TPRMLCAGS LEGKTDACQGDSGGPLVS S DARD I WYLAG 
IVSWGDECAKPNKPGVYTRVTALRDWITSKTGI 

Transmembrane domain: 

amino acids 21-40 (type II) 
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FIGURE 1Q7 



AGAGAAAGAAGCGTCTCCAGCTGAAGCCAATGCAGCCCTCCGGCTCTCCGCGAAGAAGTTCCCTG 
CCCCGATGAGCCCCCGCCGTGCGTCCCCGACTATCCCCAGGCGGGCGTGGGGCACCGGGCCCAGC 
GCCGACGATCGCTGCCGTTTTGCCCTTGGGAGTAGGATGTGGTGAAAGGATGGGGCTTCTCCCTT 
ACGGGGCTCACAATGGCCAGAGAAGATTCCGTGAAGTGTCTGCGCTGCCTGCTCTACGCCCTCAA 
TCTGCTCTTTTGGTTAATGTCCATCAGTGTGTTGGCAGTTTCTGCTTGGATGAGGGACTACCTAA 
ATAAT GT T C TCAC T T TAACTGCAGAAACGAGGGTAGAGGAAGCAG TCAT T T T GACT TAC T T TCCT 
GTGGTTCATCCGGTCATGATTGCTGTTTGCTGTTTCCTTATCATTGTGGGGATGTTAGGATATTG 
TGGAACGGTGAAAAGAAATCTGTTGCTTCTTGCATGGTACTTTGGAAGTTTGCTTGTCATTTTCT 
GT GTAGAAC T GGC TT GT GGCGT T T GGACATATGAACAGGAAC TTAT GG TTCCAGTACAATGGT CA 
GATAT GGT CAC T T TGAAAGCCAGGAT GACAAAT TATGGAT TACCTAGATATCGGTGGC T TAC T CA 
y„ TGCT TGGAATT T T TT TCAGAGAGAGT T TAAGTGC TGT GGAGTAGTATAT T TCACTGAC TGGTTGG 

%l AAATGACAGAGATGGACTGGCCCCCAGATTCCTGCTGTGTTAGAGAATTCCCAGGATGTTCCAAA 
^ CAGGCCCACCAGGAAGATCTCAGTGACCTTTATCAAGAGGGTTGTGGGAAGAAAATGTATTCCTT 
13 TT TGAGAGGAACCAAACAAC TGCAGGT GCTGAGG T T T C TGGGAATC TC CAT TGGGG TGACACAAA 

0! TCCTGGCCATGATTCTCACCATTACTCTGCTCTGGGCTCTGTATTATGATAGAAGGGAGCCTGGG 
yj ACAGACCAAATGATGTCCTTGAAGAATGACAACTCTCAGCACCTGTCATGTCCCTCAGTAGAACT 

GT TGAAACCAAGCCT GTCAAGAAT CT T TGAACACACAT CCATGGCAAACAGCT T TAATACACACT 
TTGAGATGGAGGAGT TATAAAA^lGAAATGTCACAGAAGAAAACCACAAACTTGTTT TAT TGGACT 
Sit TGTGAATTTTTGAGTACATACTATGTGTTTCAGAAATATGTAGAAATA7\AAATGTTGCCATAAAA 
*K TAACACCTAAGCATATACTAT TC TATGCT T TAAAATGAGGAT GGAAAAGT T T CATG TCATAAGTC 

s ACCACCTGGACAATAAT TGATGCCCTTAAAATGCTGAAGACAGATGTCATACCCAC TGTGTAGCC 

#*§ TGTGTATGACTTTTACTGAACACAGTTATGTTTTGAGGCAGCATGGTTTGATTAGCATTTCCGCA 
H TCCATGCAAACGAGTCACATATGGTGGGACTGGAGCCATAGTAAAGGT TGAT TTACTTCTACCAA 

111 CTAGTATATAAAG TAC TAAT TAAAT GC TAAC AT AGGAAGT TAGAAAAT AC TAATAACT T T TAT TA 

Q . CTCAGCGATCTATTCTTCTGATGCTAAAT7\AATTATATATCAGAAAACTTTCAATATTGGTGACT 
j il ACCTAAATGTGAT TT TTGCTGGT TACTAAAATATTCT TACCACTTAAAAGAGCAAGCTAACACAT 

<5 TGTCTTAAGCTGATCAGGGATTTTTTGTATATAAGTCTGTGTTAAATCTGTATAATTCAGTCGAT 
If; T TCAGT TC TGAT AATGT TAAGAATAAC CAT TAT GAAAAGGAAAAT TT G T CCT GTATAGCATCAT T 

^ ATTTTTAGCCTTTCCTGTTAATAAAGCTTTACTATTCTGTCCTGGGCTTATATTACACATATAAC 

T GT TAT TTAAATAC T TAAC CAC TAAT T T T GAAAAT TACCAGTGT GAT ACATAGGAATCAT TAT TC 
AGAAT GTAGT CTGGTC T TTAGGAAGT AT TAATAAGAAAAT T TGCACATAACT TAGT TGAT TCAGA 
AAGGACTTGTATGCTGTTTTTCTCCCAAATGAAGACTCTTTTTGACACTAAACACTTTTTAAAAA 
GCT TATCT T TGCCT TC TCCAAACAAGAAGCAATAGTC TCCAAGT CAATATAAAT TC TACAGAAAA 
TAGTGT T C T T T T T C TC CAGAAAAATGCT TGTGAGAATCAT TAAAACATGTGACAAT T TAGAGATT 
CTTTGTTT TAT T T CAC T GAT TAAT AT AC TGT GG C AAAT TAC AC AG AT TAT TAAAT T T T T T T ACAA 
GAGTATAGT ATAT T TATT TGAAAT GGGAAAAGTGCAT T T TACTGTAT TT TGTGTAT TT TGT T TAT 
T TCTCAGAATATGGAAAGAAAATTAAAATGTGTCAATAAATATT T TCTAGAGAGTAA 
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FIGURE 108 



MAREDSWCLRCLLYALNLLFWLMSISV 

VMIAVCCFLIIVGMLGYCGTVKRNLLLIAWYFGSLLVIFCVEIACGVWTYEQELMVPVQWSDMVT 
LKARMTNYGLPRYRWLTHAWNFFQREFKCCGVVYFTDWLEMTEMDWPPDSCCVREFPGCSKQAHQ 
EDLSDLYQEGCGKKMYSFLRGTKQLQVLRFLGISIGVTQILAMILTITLLWALYYDRREPGTDQM 
MSLKNDNSQHLSCPS VELLKPSLSRI FEHTSMANS FNTHFEMEEL 

Signal peptide: 

amino acids 1-33 

Transmembrane domains : 

amino acids 12-35, 57-86, 94-114, 226-248 
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FIGURE loo 



CCAAGGCCAGAGCTGTGGACACCTTATCCCACTCATCCTCATCCTCTTCCTCTGATAAAGCCCCTACCAGTGCT 
GATAAAGTCTTTCTCGTGAGAGCCTAGAGGCCTTAAAAAAAAAAGTGCTTGAAAGAGAAGGGGACAAAGGAACA 
CCAGTATTAAGAGGATTTTCCAGTGTTTCTGGCAGTTGGTCCAGAAGGATGCCTCCATTCCTGCTTCTCACCTG 
C CT C TT C AT C AC AG GCAC C T CC GT GT C AC C C GT GGCCCT AGATC CTT GT T CT GC T T ACAT C AGC CT GAAT GAGC 
CCTGGAGGAACACTGACCACCAGTTGGATGAGTCTCAAGGTCCTCCTCTATGTGACAACCATGTGAATGGGGAG 
T GGT AC C ACT T C AC GGGC AT GGC GGGAGAT GCCAT GC CT AC CTT CT GC AT AC C AGAAAAC CACT GT GGAAC C C A 
CGCACCTGTCTGGCTCAATGGCAGCCACCCCCTAGAAGGCGACGGCATTGTGCAACGCCAGGCTTGTGCCAGCT 
TCAATGGGAACTGCTGTCTCTGGAACACCACGGTGGAAGTCAAGGCTTGCCCTGGAGGCTACTATGTGTATCGT 
CTGACCAAGCCCAGCGTCTGCTTCCACGTCTACTGTGGTCATTTTTATGACATCTGCGACGAGGACTGCCATGG 
C AGC T GC T CAGATAC C AGC GAGT GCAC AT GC GCT CC AGG AAC T GT GC T AGGC C CT G AC AGGC AGAC AT GCT T T G 
C| AT GAAAAT GAAT GT GAGC AAAAC AAC GGT GGCT GC AGT GAGATCT GT GT GAACCTCAAAAACTCCTACCGCT GT 

CP GAGT GTGGGGTT GGC CGTGT GCT AAGAAGTGATGGCAAGACTT GT GAAGAC GT TGAAGGAT GCC AC AAT AAC AA 

III TGGTGGCTGCAGCCACTCTTGCCTTGGATCTGAGAAAGGCTACCAGTGTGAATGTCCCCGGGGCCTGGTGCTGT 
W CTGAGGATAACCACACTTGCCAAGTCCCTGT GT T GT GC AAAT C AAAT GCCAT T GAAGT GAAC AT CC C C AGGG AG 

CTGGTTGGTGGCCTGGAGCTCTTCCTGACCAACACCTCCTGCCGAGGAGTGTCCAACGGCACCCATGTCAACAT 
CCTCTTCTCTCT CAAGACAT GT GGT AC AGT G GTC G AT GT GGT GAAT GAC AAG AT T GT GGC CAGC AACC T CGT GA 
CAGGTCTACCCAAGCAGACCCCGGGGAGCAGCGGGGACTTCATCATCCGAACCAGCAAGCTGCTGATCCCGGTG 
ACC T GC GAGT T TCC ACGCC T GT AC ACC AT TT CT GAAGGAT AC GT TCC C AACC T TC GAAAC T CCC C AC T GGAAAT 
CAT GAGC CGAAATC ATGGGATCT TCCCATTCACTCT GGAGATCT TCAAGGAC AATGAGTTT GAAGAGCCTTACC 
GGGAAGCTCTGCCCACCCTCAAGCTTCGTGACTCCCTCTACTTTGGCATTGAGCCCGTGGTGCACGTGAGCGGC 
TTGGAAAGCTT GGT GGAGAGCT GCTTT GCCACCCCC ACCTCCAAGAT CGACGAGGT CCT GAAAT ACTACCTC AT 
CCGGGAT GGCT GTGT TTCAGATGACTCGGTAAAGCAGT ACACATCCC GGGATCACC T AGC AAAGCACT TCCAGG 
TCCCTGTCTTCAAGTTTGTGGGCAAAGACCACAAGGAAGTGTTTCTGCACTGCCGGGTTCTTGTCTGT GGAGTG 
TTGGACGAGCGTTCCCGCTGTGCCCAGGGTTGCCACCGGCGAATGCGTCGTGGGGCAGGAGGAGAGGACTCAGC 
CGGTCTACAGGGCCAGACGCTAACAGGCGGCCCGATCCGCATCGACTGGGAGGACTAGTTCGTAGCCATACCTC 
GAGT CCCTGCATTGGACGGCTCTGCTCTTTGGAGCTTCTCCCCCCACCGCCCTCTAAGAACATCTGCCAAC AGC 
TGGGTTCAGACTTCACACTGTGAGTTCAGACTCCCAGCACCAACTCACTCTGATTCTGGTCCATTCAGTGGGCA 
CAGGTCACAGCACTGCTGAACAATGTGGCCTGGGTGGGGTTTCATCTTTCTAGGGTTGAAAACTAAACTGTCCA 
CCCAGAAAGACACTCACCCCATTTCCCTCATTTCTTTCCTACACTTAAATACCTCGTGTATGGTGCAATCAGAC 
C AC AAAAT C AGAAGC T G GGT AT AAT AT T TC AAGT T AC AAACCCT AGAAAAAT T AAACAGT TACT GAAATT AT GA 
CTTAAATACCCAATGACTCCTTAAATATGTAAATTATAGTTATACCTTGAAATTTCAATTCAAATGCAGACTAA 
TT AT AGG GAAT T T GGAAGT GTAT CAAT AAAAC AGT AT AT AAT T T T 



m 
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FIGURE no 



MPPFLLLTCLFITGTSVSPVALDPCSAYISLNEPWRNTDHQLDESQGPPLCDNHVNGEWYHFTGMAGDAMP 
TFCIPENHCGTHAPVWLNGSHPLEGDGIVQRQACASFNGNCCLWNTTVEVKACPGGYYVYRLTKPSVCFHV 
YCGHFYDICDEDCHGSCSDTSECTCAPGTVLGPDRQTCFDENECEQNNGGCSEICVNLKNSYRCECGVGRV 
LRSDGKTCEDVEGCHNNNGGCSHSCLGSEKGYQCECPRGLVLSEDNHTCQVPVLCKSNAIEVNIPRELVGG 
LELFLTNTSCRGVSNGTHVNILFSLKTCGTWDVWDKIVASNLVTGLPKQTPGSSGDFIIRTSKLLIPVT 
CEFPRLYTISEGYVPNLRNSPLEIMSRNHGIFPFTLEI FKDNE FEE PY REAL PTLKLRDSLY FGIEPWHV 
SGLESLVESCFATPTSKIDEVLKYYLIRDGCVSDDSVKQYTSRDHLAKHFQVPVFKFVGKDHKEVFLHCRV 
LVCGVLDERSRCAQGCHRRMRRGAGGEDSAGLQGQTLTGGPIRIDWED 

Important features of the protein: 
Signal peptide: 

amino acids 1-16 

N-glycosylation sites. 

amino acids 89-93, 116-120, 259-263, 291-295, 299-303 

Tyrosine kinase phosphorylation sites, 
amino acids 411-418, 443-451 

N-myristoylation sites. 

amino acids 226-232, 233-239, 240-246, 252-258, 296-302, 300-306, 
522-528, 531-537 

Aspartic acid and asparagine hydroxylation site. 

amino acids 197-209 

ZP domain proteins. 

amino acids 431-457 

Calcium-binding EGF-like proteins. 

amino acids 191-212, 232-253 
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o 

: - • 



GAGAGAGGCAGC AGCT TGCTC AGCGGAC AAGGAT GC TGGGCGTGAGGG AC CAAGGC CTGC CCTGC ACTCGG 
GCCTCCTCCAGCCAGTGCTGACCAGGGACTTCTGACCTGCTGGCCAGCCAGGACCTGTGTGGGGAGGCCCT 
CCTGCT GCCT TGGGGTGACAAT CT CAGCTC CAGGCT ACAGGG AG ACC GGGAGGATC ACAGAGC CAGC ATGT 
TACAGGATCCTGACAGTGATCAACCTCTGAACAGCCTCGATGTCAAACCCCTGCGCAAACCCCGTATCCCC 
ATGGAGACCTTCAGAAAGGTGGGGATCCCCATCATCATAGCACTACTGAGCCTGGCGAGTATCATCATTGT 
GGTTGTCCTCATCAAGGTGATTCTGGATAAATACTACTTCCTCTGCGGGCAGCCTCTCCACTTCATCCCGA 
GG AAGC AGCT GT GTGACGGAGAGCTGGACT GT CC CT TGGGGGAGGACGAGGAGC ACT GTGTC AAGAGC TTC 
CCCGAAGGGCCTGCAGTGGCAGTCCGCCTCTCCAAGGACCGATCCACACTGCAGGTGCTGGACTCGGCCAC 
AGGGAACTGGTTCTCTGCCTGTTTCGACAACTTCACAGAAGCTCTCGCTGAGACAGCCTGTAGGCAGATGG 
GCTACAGC AGAGCTGT GG AGAT TG GCCC AG ACCAGG AT CTGG ATGTT G TT GAAATC AC AGAAAACAGC CAG 
GAGCTTCGCATGCGGAACTCAAGTGGGCCCTGTCTCTCAGGCTCCCTGGTCTCCCTGCACTGTCTTGCCTG 
TGGGAAGAGCCTGAAGACCCCCCGTGTGGTGGGTGGGGAGGAGGCCTCTGTGGATTCTTGGCCTTGGCAGG 



1 1| T C AGCATCCAGT ACGACAAAC AGC ACGTCT GT GGAGGGAGC ATCCT GG AC CCCC ACT G GGTCCTC ACGGC A 

GCCC ACTGCT TC AGGAAAC ATACCGATGT GTT CAAC TGGAAGGT GCGG GC AGGC TC AGAC AAACTGGGCAG 

J?* CTTCCCATCCCTGGCTGTGGCCAAGATCATCATCATTGAATTCAACCCCATGTACCCCAAAGACAATGACA 

3? TCGCCCTCATGAAGCTGCAGTTCCCACTCACTTTCTCAGGCACAGTCAGGCCCATCTGTCTGCCCTTCTTT 

U GATG AGGAGCTC ACTC CAGCCACC CC ACT C TGGAT C AT TGGAT GGGGCTT TACGAAGC AGAAT GGAGGGAA 

^ J GATGTCTGACATACTGCTGCAGGCGTCAGTCCAGGTCATTGACAGCACACGGTGCAATGCAGACGATGCGT 

J K * ACC AGGGGGAAGTCAC CGAGAAGATGAT GT GTGC AGGC AT CCCGGAAGGGGGTGTGGACACCTGCC AG GGT 

5lf GACAGTGGTGGGCCCCTGATGTACCAATCTGACCAGTGGCATGTGGTGGGCATCGTTAGCTGGGGCTATGG 

c:i 

jr B CTGCGGGGGC CC GAGCACCCCAGGAGT AT ACACC AAGGT C TC AGCC TAT C TC AACT GG AT CT ACAATGTCT 

I II 

GGAAGGCTGAGCTG TAA TGCTGCTGCCCCTTTGCAGTGCTGGGAGCCGCTTCCTTCCTGCCCTGCCCACCT 
GGGGATCCCC C AAAGT CAGACACAGAGC AAGAGT CCCCTT GGGT AC ACCC CT CT GC CCAC AGCCTC AGCAT 
T TCT TGGAGC AGCAAAGGGCC T CAAT TCCTGT AAGAGACCCT CGCAGC CC AGAGGC GCCC AGAGGAAGTCA 
GC AGCCCT AGCT CGGCC AC ACT TGGTGCT CCC AGCATCCC AGGG AG AGAC AC AGCC CACT GAAC AAGGT CT 
CAGGGGTAT T GC TAAGCC AAGAAGGAACTT TCCC AC ACT ACT GAATGGAAGC AGGC TGT C TT GTAAAAGCC 
CAGATC ACT GTG GGCT GGAG AGGAGAAGGAAAGGGT CT GCGC CAGCCCTGTCCGTCTTCACC CATC CC CAA 
GC CT AC T AGAGC AAGAAACC AGTT GT AATAT AAAAT GC AC TG CCCT ACTGTTGGTATG ACT ACCGT T ACCT 
ACTGTTGTCATTGTTATTACAGCTATGGCCACTATTATTAAAGAGCTGTGTAACATCTCTGGCAAAAAAAA 
AAAA 
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FIGURE 112 

MLQDPDSDQPLNSLDVKPLRKPRIPMETFRKVGIPIIIALLSLASIIIVWLIKVILDKYYFLCG 
QPLHFIPRKQLCDGELDCPLGEDEEHCVKSFPEGPAVAVRLSKDRSTLQVLDSATGNWFSACFDN 
FTEALAETACRQMGYSRAVEIGPDQDLDVVEITENSQELRMRNSSGPCLSGSLVSLHCLACGKSL 
KTPRWGGEEASVDSWPWQVSIQYDKQHVCGGSILDPHWVLTAAHCFRKHTDVFNWKVRAGSDKL 
GSFPSIAVAKIIIIEFNPMYPKDNDIALMKLQFPLTFSGTVRPICLPFFDEELTPATPLWIIGWG 
FTKQNGGKMSDILLQASVQVIDSTRCNADDAYQGEVTEKMMCAGIPEGGVDTCQGDSGGPLMYQS 
DQWHWGIVSWGYGCGGPSTPGVYTKVSAYLNWIYNVWKAEL 

3" ~ 

Q Transmembrane domain: 

W amino acids 32-53 (typell) 



o 
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GGCTGGACTGGAACTCCTGGTCCCAAGTGATCCACCCGCCTCAGCCTCCCAAGGTGCTGTGATTA 
TAGGTGTAAGCCACCGTGTCTGGCCTCTGAACAACTTTTTCAGCAACTAAAAAAGCCACAGGAGT 
TGAACTGCTAGGATTCTGACTATGCTGTGGTGGCTAGTGCTCCTACTCCTACCTACATTAAAATC 
TGTTTTTTGTTCTCTTGTAACTAGCCTTTACCTTCCTAACACAGAGGATCTGTCACTGTGGCTCT 
GGCCCAAACCTGACCTTCACTCTGGAACGAGAACAGAGGTTTCTACCCACACCGTCCCCTCGAAG 
CCGGGGACAGCCTCACCTTGCTGGCCTCTCGCTGGAGCAGTGCCCTCACCAACTGTCTCACGTCT 
GGAGGCACTGACTCGGGCAGTGCAGGTAGCTGAGCCTCTTGGTAGCTGCGGCTTTCAAGGTGGGC 
JU CTTGCCCTGGCCGTAGAAGGGAT TGA CAAGCCCGAAGATTTCATAGGCGATGGCTCCCACTGCCC 
lit AGGCATCAGCCTTGCTGTAGTCAATCACTGCCCTGGGGCCAGGACGGGCCGTGGACACCTGCTCA 

m GAAGCAGTGGGTGAGACATCACGCTGCCCGCCCATCTAACCTTTTCATGTCCTGCACATCACCTG 

^ 

|, | ATCCATGGGCTAATCTGAACTCTGTCCCAAGGAACCCAGAGCTTGAGTGAGCTGTGGCTCAGACC 
tjf CAGAAGGGGTC T GC T TAGACCACCT GGT T T AT GT GACAGGAC T T GCAT TCTCCT GGAACATGAGG 

V3 GAACGCCGGAGGAAAGCAAAGT GGCAGGGAAGGAAC T T GT GCCAAAT TAT GGGT CAGAAAAGAT G 

GAGG TGTTGGGT TAT CACAAGG C AT C GAG TCTCCT GCAT T CAG T GGACAT G T GGGGGAAGG GC T G 
CCGATGGCGCATGACACACTCGGGACTCACCTCTGGGGCCATCAGACAGCCGTTTCCGCCCCGAT 
I n CCACGTACCAGCTGCTGAAGGGCAACTGCAGGCCGATGCTCTCATCAGCCAGGCAGCAGCCAAAA 

*8? 'i. 

Q TCTGCGATCACCAGCCAGGGGCAGCCGTCTGGGAAGGAGCAAGCAAAGTGACCATTTCTCCTCCC 
Pf CTCCTTCCCTCTGAGAGGCCCTCCTATGTCCCTACTAAAGCCACCAGCAAGACATAGCTGACAGG 
% GGCTAATGGCTCAGTGTTGGCCCAGGAGGTCAGCAAGGCCTGAGAGCTGATCAGAAGGGCCTGCT 

GTGCGAACACGGAAATGCCTCCAGTAAGCACAGGCTGCAAAATCCCCAGGCAAAGGACTGTGTGG 
CTCAATTTAAATCATGTTCTAGTAATTGGAGCTGTCCCCAAGACCAAAGGAGCTAGAGCTTGGTT 
CAAAT GATC TCCAAGGGCCCT TATACCCCAGGAGAC T TTGAT T T GAAT T T GAAACC CCAAAT CCA 
AACCTAAGAACCAGGTGCATTAAGAATCAGTTATTGCCGGGTGTGGTGGCCTGTAATGCCAACAT 
TTTGGGAGGCCGAGGCGGGTAGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGG 
TGAAACCCCTGTCTCTACTAAAAATACAAAAAAACTAGCCAGGCATGGTGGTGTGTGCCTGTATC 
CCAGCTAC T CGGGAGGCT GAGACAGGAGAAT TAC TTGAACCTGGGAGG TGAAGGAGGC TGAGACA 
GGAGAATCACT TCAGCC TGAGCAACAC AGCGAGACTCT GT CTCAGAAAAAAT AAAAAAAGAAT TA 
TGGTTATTTGTAA 
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FIGURE ha 

MLWWLVLLLLPTLKSVFCSLVTSLYLPNTEDLSLWLWPKPDLHSGTRTEVSTHTVPSKPGTASPC 
WPLAGAVPSPTVSRLEALTRAVQVAEPLGSCGFQGGPCPGRRRD 

Signal peptide: 

amino acids 1-15 



u 

Q 
CIS 

m 
iy 
ui 
%a 

O 
If! 
Hi 
W 

ru 
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FIGURE 115 



CAGCAGT GGTC TCTCAGTCC TC TCAAAGCAAGGAAAGAGTACT GT GT GCTGAGAGACCATGGCAA 
AGAAT CCT CCAGAGAATT GT GAAGACT GT CACAT TC TAAAT GCAGAAGCT T T TAAATCCAAGAAA 
ATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCTAACTCTAATTGTCCT 
GTT T T GGGGGAGCAAGCAC T TCT GGCCGGAGGTACCCAAAAAAGC CTATGACAT GGAGCACACT T 
T CTACAGCAATGGAGAGAAGAAGAAGATT TACATGGAAATTGATCCT GTGACCAGAACTGAAATA 
TTCAGAAGCGGAAATGGCACTGATGAAACATTGGAAGTGCACGACTTTAAAAACGGATACACTGG 
CATCTACTTCGTGGGTCTTCAAAAATGTTTTATCAAAACTCAGATTAAAGTGATTCCTGAATTTT 
few CTGAACCAGAAGAGGAAATAGATGAGAAT GAAGAAAT TACCACAACT TTCTT TGAACAGTCAGTG 

O ATTTGGGTCCCAGCAGAAAAGCCTATTGAAAACCGAGATTTTCTTAAAAATTCCAAAATTCTGGA 

GAT T T GT GATAACGTGACCATGT AT TGGATCAAT CCCAC TC TAATAT CAGT T TC T GAGT TACAAG 

m 

•5 3 ACT T T GAGGAGGAGGGAGAAGAT C T TCAC T T T CCTGCCAAC GAAAAAAAAGGGAT TGAACAAAAT 

m 

Iff GAACAGTGGGTGGTCCCTCAAGTGAAAGTAGAGAAGACCCGTCACGCCAGACAAGCAAGTGAGGA 

*fj AGAACTTCCAATAAATGAC TATACT GAAAATGGAATAGAAT TTGATCCCATGCTGGATGAGAGAG 

4* GTTATTGTTGTATTTACTGCCGTCGAGGCAACCGCTATTGCCGCCGCGTCTGTGAACCTTTACTA 

J,,. GGCTACTACCCATATCCATACTGCTACCAAGGAGGACGAGTCATCTGTCGTGTCATCATGCCTTG 

Ijj. TAACTGGTGGGTGGCCCGCATGCTGGGGAGGGTCTAATAGGAGGTTTGAGCTCAAATGCTTAAAC 

f3 TGCTGGCAACATATAATAAATGCATGCTATTCAATGAATTTCTGCCTATGAGGCATCTGGCCCCT 

It} GGTAGCCAGCTCTCCAGAATTACTTGTAGGTAATTCCTCTCTTCATGTTCTAATAAACTTCTACA 

W T TAT CACCAAAAA&AAAAAAAAAAAAA 
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FIGURE 116 



MAKNPPENCEDCHILNAEAFKSKKICKSLKICGLVFGIIALTLIVLF^GSKHFWPEVPKKAYDME 
HTFYSNGEKKKIYMEIDPVTRTEIFRSGNGTDETLEVHDFKNGYTGIYFVGLQKCFIKTQIKVIP 
E FSEPEEE I DENEE I TTT FFEQSVI WVPAEKPIENRDFLKNSKI LE I CDNVTMYWINPTL I S VSE 
LQDFEEEGEDLHFPANEKKGIEQNEQWWPQVKVEKTRHARQASEEELPINDYTENGIEFDPMLD 
ERGYCC I YCRRGNRYCRRVCEPLLG YYPYPYC YQGGRVI CRVIMPCNWWVARMLGRV 

Important features of the protein: 
Signal peptide: 

amino acids 1-4 0 

Transmembrane domain : 

amino acids 25-47 (type II) 

N-glycosylation sites. 

amino acids 94-97, 180-183 

Glycosaminoglycan attachment sites. 

amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 
242 

N-myristoylation sites. 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 

Microbodies C- terminal targeting signal. 

amino acids 315-317 

Cytochrome c family heme -binding site signature. 

amino acids 9-14 
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FIGURE 117 



GAGCTCCCCTCAGGAGCGCGTTAGCTTCACACCTTCGGCAGCAGGAGGGCGGCAGCTTCTCGCAGGCGGCA 
GGGCGGGCGGCCAGGATCATGTCCACCACCACATGCCAAGTGGTGGCGTTCCTCCTGTCCATCCTGGGGCT 
GGCCGGCTGCATCGCGGCCACCGGGATGGACATGTGGAGCACCCAGGACCTGTACGACAACCCCGTCACCT 
CCGTGTTCCAGTACGAAGGGCTCTGGAGGAGCTGCGTGAGGCAGAGTTCAGGCTTCACCGAATGCAGGCCC 
TATTTCACCATCCTGGGACTTCCAGCCATGCTGCAGGCAGTGCGAGCCCTGATGATCGTAGGCATCGTCCT 
GGGTGCCATTGGCCTCCTGGTATCCATCTTTGCCCTGAAATGCATCCGCATTGGCAGCATGGAGGACTCTG 
CCAAAGCCAACATGACACTGACCTCCGGGATCATGTTCATTGTCTCAGGTCTTTGTGCAATTGCTGGAGTG 
TCTGTGTTTGCCAACATGCTGGTGACTAACTTCTGGATGTCCACAGCTAACATGTACACCGGCATGGGTGG 
M" GATGGTGCAGACTGTTCAGACCAGGTACACATTTGGTGCGGCTCTGTTCGTGGGCTGGGTCGCTGGAGGCC 
O TCACACTAATTGGGGGTGTGATGATGTGCATCGCCTGCCGGGGCCTGGCACCAGAAGAAACCAACTACAAA 

GCCGTTTCTTATCATGCCTCAGGCCACAGTGTTGCCTACAAGCCTGGAGGCTTCAAGGCCAGCACTGGCTT 
* f I T GGGT C CAAC AC C AAAAAC AAGAAGAT ATACGAT GG AGGT GCCCGC AC AG AGGAC GAGGT AC AATCT T AT C 

"ft CTTCCAAGCACGACTATGT GTAA TGCTCTAAGACCTCTCAGCACGGGCGGAAGAAACTCCCGGAGAGCTCA 
,f| CCCAAAAAACAAGGAGATCCCATCTAGATTTCTTCTTGCTTTTGACTCACAGCTGGAAGTTAGAAAAGCCT 
jj* CGATTTCATCTTTGGAGAGGCCAAATGGTCTTAGCCTCAGTCTCTGTCTCTAAATATTCCACCATAAAACA 

GC TGAGTT ATTT ATGAAT T AGAGGCT AT AG CTC ACATTTT CAATCCTCT ATT TC T TTT TT T AAAT AT AAC T 
TTCTACTCTGATGAGAGAATGTGGTTTTAATCTCTCTCTCACATTTTGATGATTTAGACAGACTCCCCCTC 
!Jj TTCCTCCTAGTCAATAAACCCATTGATGATCTATTTCCCAGCTTATCCCCAAGAAAACTTTTGAAAGGAAA 
W GAGT AGAC CC AAAG ATGT T ATT TT CTGCTGTTT GAAT TTT GT CTCC CCACCCCC AACT T GGC TAGT AATAA 

|W ACACTTACTGAAGAAGAAGCAATAAGAGAAAGATATTTGTAATCTCTCCAGCCCATGATCTCGGTTTTCTT 
J£? ACACTGTGATCTTAAAAGTTACCAAACCAAAGTCATTTTCAGTTTGAGGCAACCAAACCTTTCTACTGCTG 

TTGACATCTTCTTATTACAGCAACACCATTCTAGGAGTTTCCTGAGCTCTCCACTGGAGTCCTCTTTCTGT 
C GCGGGTC AG AAAT T GTC CCTAGAT GAATGAGAAAATT AT TT TTT T TAATTT AAGTCCT AAATATAGT TAA 
AATAAATAAT GT TTT AGT AAAATGAT ACAC T ATCTC TGTGAAAT AG CCT C ACCC CT AC ATGT GGAT AG AAG 
GAAATGAAAAAATAATT GCT TT GAC ATT GT CT AT AT GGT ACTT TGTAAAGTCAT GCT T AAGT AC AAAT TCC 
ATGAAAAGCT CACACCTGTAATCCTAGC ACTTT GGGAGGC TG AGGAGGAAGGAT CACT TGAGCCCAGAAGT 
T C GAGACT AGCCTGGGCAAC AT GGAGAAGC CCT GTC TCT ACAAAAT AC AGAG AGAAAAAATC AGCC AGTC A 
TGGT GGCAT ACACCTGT AGTCCCAGCAT TC CGGGAGGCT GAGGTGGGAGG AT CACT TG AGCCCAGGGAGGT 
T GGGGCTGCAGT GAGCC ATGAT CACACC AC TGCACTCCAGCC AGGT GAC ATAGC GAGAT C CT GTCT AAAAA 
AATAAAAAAT AAAT AATGGAACACAGC AAGT CC TAGGAAGTAGGTT AAAACTAATT CT TTAA 



APP_ID=10063594 Page 269 of 320 



FIGURE 118 



MS TT TCQWAFLLS I LGLAGC I AATGMDMWS TQDL YDNPVTS VFQYEGLWRS CVRQS SGFTECRP 
YFTILGLPAMLQAVRALMIVGIVLGAIGLLVSIF7VLKCIRIGSMEDSAKANMTLTSGIMFIVSGL 
CAIAGVSVFANMLVTNFWMSTANMYT 

CRGLAPEETNYKAVSYHASGHSVAYKPGGFKASTGFGSNTKNKKIYDGGARTEDEVQSYPSKHDY 
V 

Signal peptide: 

amino acids 1-23 

Transmembrane domains: 

amino acids 81-100," 121-141, 173-194 
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FIGURE HQ 



GGAAAAACTGTTCTCTTCTGTGGCACAGAGAACCCTGCTTCAAAGCAGAAGTAGCAGTTCCGGAGTCC 
AGCTGGCTAAAACTCATCCCAGAGGATAATGGCAACCCATGCCTTAGAAATCGCTGGGCTGTTTCTTG 
GTGGTGTTGGAATGGTGGGCACAGTGGCTGTCACTGTCATGCCTCAGTGGAGAGTGTCGGCCTTCATT 
GAAAACAACATCGTGGTTTTTGAAAACTTCTGGGAAGGACTGTGGATGAATTGCGTGAGGCAGGCTAA 
CAT CAGGATGCAGTGCAAAAT CTATGATTCCCT GCT GGCTCTT TCTC CGGAC CTACAGGCAGCCAGAG 
GACTGATGTGTGCTGCTTCCGTGATGTCCTTCTTGGCTTTCATGATGGCCATCCTTGGCATGAAATGC 
ACCAGGTGCACGGGGGACAATGAGAAGGTGAAGGCTCACATTCTGCTGACGGCTGGAATCATCTTCAT 
jU CATCACGGGCATGGTGGTGCTCATCCCTGTGAGCTGGGTTGCCAATGCCATCATCAGAGATTTCTATA 
13 ACT CAATAGTGAATGTT GCC CAAAAAC GT GAGCTTGGAGAAGCTC TCTACTTAGGAT GGACCACGGCA 

*** CTGGTGCTGATTGTTGGAGGAGCTCTGTTCTGCTGCGTTTTTTGTTGCAACGAAAAGAGCAGTAGCTA 

m 

r ;; CAGATACT CGAT ACCTT CCCATC GCACAACCCAAAAAAGTTAT CACACCGGAAAGAAGTCACCGAGCG 

p5 TCTACTCCAGAAGTCAGTATGTG TAG TTGTGTATGTTTTTTTAACTTTACTATAAAGCCATGCAAATG 

,n ACAAAAATCTATATTACTTTCTCAAAATGGACCCCAAAGAAACTTTGATTTACTGTTCTTAACTGCCT 

Vs:? 

£ AATCT TAATTACAGGAACTGT GCATCAGCTAT T TAT GAT TCTATAAGCTAT TTCAGCAGAATGAGATA 

5 T TAAAC C CAAT GC T T T GAT T GTT C T AGAAAGT ATAGT AAT T T GT T T T CTAAG G T GGT T CAAGC AT C T A 

U CTCTTTTTATCATTTACTTCAAAATGACATTGCTAAAGACTGCATTATTTTACTACTGTAATTTCTCC 

s f 5 ACGACAT AG CAT TAT GT ACAT AGAT GAG T GTAACAT T TAT AT C T CAC ATAGAGACAT GC T TAT AT GGT 

■£*'% 

T\ T TTATTTAAAAT GAAAT GCCAGTCCAT TACACT GAATAAATAGAACTCAACTATT GCTTT TCAGGGAA 

III 

f~% ATC AT GGAT AGGG T T GAAGAAGG T T AC TAT T AAT T GT TT AAAAACAGC T TAGG GAT T AAT GT C C T C CA 

JfJ TTTATAATGAAGATTAAAAT GAAGGCT TTAAT CAGCATT GTAAAGGAAATT GAAT GGCTTTCT GATAT 

GCTGTTTTTTAGCCTAGGAGTTAGAAATCCTAACTTCTTTATCCTCTTCTCCCAGAGGCTTTTTTTTT 
CTTGTGTATTAAATTAACATTTTTAAAACGCAGATATTTTGTCAAGGGGCTTTGCATTCAAACTGCTT 
T TC CAGGG C TAT ACT C AGAAGAAAGAT AAAAGT GT GATC T AAGAAAAAG T GAT GGT T TTAGGAAAG T G 
AAAATATT T TTGT TTTT GTAT TT GAAGAAGAAT GAT GCAT TTT GACAAGAAAT CATATATGTATGGAT 
AT AT T T TAATAAGTAT T T GAGTACAGACT T TGAGGT T TCAT CAAT ATAAATAAAAGAGC AGAAAAAT A 
TGTCTTGGTTTTCATTTGCTTACCAAAAAAACAACAACAAAAAAAGTTGTCCTTTGAGAACTTCACCT 
GCTCCTATGTGGGTACCTGAGTCAAAATTGTCATTTTTGTTCTGTGAAAAATAAATTTCCTTCTTGTA 
CCATTTCTGTTTAGTTTTACTAAAATCTGTAAATACTGTATTTTTCTGTTTATTCCAAATTTGATGAA 
ACT GACAAT CCAATT TGAAAGTT T GTGTCGACGTCT GTCTAGCTTAAAT GAAT GT GT TCTATT TGCTT 
TATACAT TTATAT TAATAAATTGTACATT TTTCTAAT T 
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FIGURE 120 

MATHALE IAGL FLGGVGMVG WAVTVMPQWRVSAFIENNI W 

IYDSLLALSPDLQAARGLMCAASVMS FIAFMMAILGMKCTRCTGDNEKVKAHILLTAGIIFIITG 
MVVLIPVSWVANAI IRDFYNS IVNVAQKRELGEALYLGWTTALVLIVGGALFCCVFCCNEKSSSY 
RYSIPSHRTTQKSYHTGKKSPSVYSRSQYV 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 
amino acids 82-101, 118-145, 164-188 



1 y 
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FIGURE 121 

GGAGAGAGGCGCGCGGGTGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGGCGGAG 
CCAGACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTCCGCGCTGCCCGGCAGCC 
GGGAGCCATGCGACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCGGCCTCCTGCTGCTCC 
TGCTGCTGCAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCAAGGGGAAGCAAAAGGCGCAG 
CTCCGGCAGAGGGAGGTGGTGGACCTGTATAATGGAATGTGCTTACAAGGGCCAGCAGGAGTGCC 
TGGTCGAGACGGGAGCCCTGGGGCCAATGTTATTCCGGGTACACCTGGGATCCCAGGTCGGGATG 
GATT CAAAGGAGAAAAGGGGGAATG TC TGAGGGAAAGC TT T GAGGAGT CC T GGACACCCAAC TAC 
H AAGCAGTGTTCATGGAGTTCATTGAATTATGGCATAGATCTTGGGAAAATTGCGGAGTGTACATT 
l;( TACAAAGAT GCGT T CAAATAGT GCT C T AAGAGT T TT GT TCAGT GGCT CAC T T CGGC TAAAATGCA 

m GAAATGCATGCTGTCAGCGTTGGTATTTCACATTCAATGGAGCTGAATGTTCAGGACCTCTTCCC 
fjj ATTGAAGCTATAATT TATT TGGACCAAGGAAGCCCTGAAATGAAT TCAACAATTAATAT TCATCG 

Ifl CACTTCTTCTGTGGAAGGACTTTGTGAAGGAATTGGTGCTGGATTAGTGGATGTTGCTATCTGGG 
**J T T GGCAC T T GT TCAGAT TAC CCAAAAGGAGATGC TT C TAC T GGAT GGAAT T CAGT T TC T CGCATC 

ATTATTGAAGAACTACCAAA ATAAA TGCTTTAATTTTCATTTGCTACCTCTTTTTTTATTATGCC 
£•* T T GGAAT GG TTCAC T TAAATGACAT T T TAAATAAGT T TATGTATACAT CT GAAT GAAAAGCAAAG 

||! CT AAATAT G TT TACAGACCAAAG T G T GAT T TCACAC TGTT T TTAAAT C TAGCAT TAT TCAT T TT G 

O CT T CAAT CAAAAGT GGTT TCAATAT T T TT T TTAGT TGGT TAGAATAC T T TC T TCATAG TCACAT T 

%U CTCTCAACCTATAATTTGGAATATTGTTGTGGTCTTTTGTTTTTTCTCTTAGTATAGCATTTTTA 

!!!'! AAAAAATATAAAAGC TAC CAAT C T T T G T ACAAT T T G TAAAT GT T AAGAAT t T T T T T TAT AT C T G T 

v \ i 

T AAATAAAAAT TAT T TCCAACA 



APPJD=10063594 



Page 273 of 320 



FIGURE 122 



MRPQGPAASPQRLRGLLLLLLLQLPAPSSASEIPKGKQKAQLRQREWDLYNGMCLQGPAGVPGR 
DGSPGANVIPGTPGIPGRDGFKGEKGECLRESFEESWTPNYKQCSWSSLNYGIDLGKIAECTFTK 
MRSNSALRVLFSGSLRLKCRNACCQRWYFTFNGAECSGPLPIEAIIYLDQGSPEMNSTINIHRTS 
SVEGLCEGIGAGLVDVAIWVGTCSDYPKGDASTGWNSVSRIIIEELPK 

Signal peptide: 

amino acids 1-30 

Transmembrane domain: 

amino acids 195-217 
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FIGURE 123 



GCTGAGCGTGTGCGCGGTACGGGGCTCTCCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCTGAA 
CTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAGCCCCAA 
AT T GCCT GGAAGAATACATCAT GT T T T TCGATAAGAAGAAATT GTAGGAT CCAGT T TTTTTTTTA 
ACCGCCCCCTCCCCACCCCCCAAAAAAACTGTAAAGATGCAAAAACGTAATATCCATGAAGATCC 
TAT TACC TAGGAAGAT TT TGAT GT T T TGC T GCGAAT GCGGT GT T GGGAT T TAT T TGT T C T TGGAG 
TGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTCCATCTCCCAAGGGGTCCAATTTT 
TCTTCCTGGGTGTCAGCGAGCCCTGACTCACTACAGTGCAGCTGACAGGGGCTGTCATGCAACTG 
GCCCCTAAGCCAAAGCAAAAGACCTAAGGACGACCTTTGAACAATACAAAGGATGGGTT TCAAT G 
TAATTAGGCTACTGAGCGGATCAGCTGTAGCACTGGTTATAGCCCCCACTGTCTTACTGACAATG 
CTTTCTTCTGCCGAACGAGGATGCGCTAAGGGCTGTAGGTGTGAAGGCAAAATGGTATATTGTGA 
I ; ATCTCAGAAATTACAGGAGATACCCTCAAGTATATCTGCTGGTTGCTTAGGTTTGTCCCTTCGCT 
IZ ATAACAGCC T TCAAAAAC T TAAGTATAAT CAAT T TAAAGGGCTCAACCAGCT CACCTGGCTATAC 

%f CTTGACCATAACCATATCAGCAATATTGACGAAAATGC TT TTAAT GGAATACGCAGACTCAAAGA 

O GCTGATTCTTAGTTCCAATAGAATCTCCTATTTTCTTAACAATACCTTCAGACCTGTGACAAATT 
ffl TACGGAACTTGGATCTGTCCTATAATCAGCTGCATTCTCTGGGATCTGAACAGTTTCGGGGCTTG 
ill CGGAAGCTGCTGAGTTTACATTTACGGTCTAACTCCCTGAGAACCATCCCTGTGCGAATATTCCA 
X£& AGAC T GCCGCAACC T GGAAC TT T TGGACCT GGGATATAACCGGAT CCGAAGT TTAGCCAGGAAT G 

**J TCTTTGCTGGCATGATCAGACTCAAAGAACTTCACCTGGAGCACAATCAATTTTCCAAGCTCAAC 
\fj CTGGCCCTTTTTCCAAGGTTGGTCAGCCTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGT 

CATAGGACAGACCATGTCCTGGACCTGGAGCTCCTTACAAAGGCTTGATTTATCAGGCAATGAGA 
TCGAAGCTTTCAGTGGACCCAGTGTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGAT 
^ TCCAACAAGCTCACATTTATTGGTCAAGAGATTTTGGATTCTTGGATATCCCTCAATGACATCAG 
jf s TCTTGCTGGGAATATATGGGAATGCAGCAGAAATATTTGCTCCCTTGTAAACTGGCTGAAAAGTT 
111 T TAAAGGT CTAAGGGAGAAT ACAAT TATC T GTGC CAGTCCCAAAGAGCTGCAAGGAGTAAATGTG 

Qj ATCGATGCAGT GAAGAAC TACAGCAT C TGT GGCAAAAGTAC TACAGAGAGG T T T GAT CTGGCCAG 

hi GGCTCTCCCAAAGCCGACGTTTAAGCCCAAGCTCCCCAGGCCGAAGCATGAGAGCAAACCCCCTT 
JZ TGCCCCCGACGGTGGGAGCCACAGAGCCCGGCCCAGAGACCGATGCTGACGCCGAGCACATCTCT 
Jj| TTCCATAAAATCATCGCGGGCAGCGTGGCGCTTTTCCTGTCCGTGCTCGTCATCCTGCTGGTTAT 
fW CTACGTGTCATGGAAGCGGTACCCTGCGAGCATGAAGCAGCTGCAGCAGCGCTCCCTCATGCGAA 

GGCACAGGAAAAAGAAAAGACAGTCCCT AAAGCAAAT GACT CCCAGCACCCAGGAAT T T TATGT A 
GAT TATAAACCCACCAACACGGAGACCAGCGAGATGC T GCT GAAT GGGACGGGACCC TGCACC TA 
TAACAAAT CGGGC TCCAGGGAGT GT GAGG TATGAACCAT T GTGAT AAAAAGAGC TC T TAAAAGCT 
GGGAAATAAGTGGTGCTTTATTGAACTCTGGTGACTATCAAGGGAACGCGATGCCCCCCCTCCCC 
TTCCCTCTCCCTCTCACTTTGGTGGCAAGATCCTTCCTTGTCCGTTTTAGTGCATTCATAATACT 
GGTCATTTTCCTCTCATACATAATCAACCCATTGAAATTTAAATACCACAATCAATGTGAAGCTT 
GAACTCCGGTTTAATATAATACCTATTGTATAAGACCCTTTACTGATTCCATTAATGTCGCATTT 
GTTTTAAGATAAAACTTCTTTCATAGGTAAAAAAAAAAA 
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FIGURE 124 

MGFNVIRLLSGSAVALVIAPTVLLTMLSSAERGCPKGCRCEGKMVYCESQKLQEIPSSISAGCLG 
LSLRYNSLQKLKYNQFKGLNQLTWLYLDHNHISNIDENAFNGIRRLKELILSSNRISYFLNNTFR 
PVTNLRNLDLSYNQLHSLGSEQFRGLRKLLSLHLRSNSLRTIPVRIFQDCRNLELLDLGYNRIRS 
LARWFAGMIRLKELHLEHNQFSKLNLALFPRLVSLQNLYLQWNKISVIGQTMSWTWSSLQRLDL 
SGNEIEAFSGPSVFQCVPNLQRLNLDSNKLTFIGQEILDSWISLNDISLAGNIWECSRNICSLVN 
WLKS FKGLRENTI ICASPKELQGVNVIDAVKNYS ICGKSTTERFDLARALPKPTFKPKLPRPKHE 
SKPPLPPTVGATEPGPETDADAEHISFHKI IAGSVALFLSVLVILLVIYVSWKRYPASMKQLQQR 
SLMRRHRKKKRQSLKQMTPSTQE FYVDYKPTNTETSEMLLNGTGPCTYNKSGSRECEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

Transmembrane domain: 

amino acids 420-442 

N-glycosylation sites. 

amino acids 126-129, 357-360, 496-499, 504-507 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 465-468 

Tyrosine kinase phosphorylation site. 

amino acids 136-142 

N-myristoylation sites. 

amino acids 11-16, 33-38, 245-250, 332-337, 497-502, 507-512 
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FIGURE 125 

CCGT TATCGTCT TGCGCTAC TGCTGA ATGT CCGTCCCGGAGGAGGAGGAGAGGCT TT TGCCGCTG 
ACCCAGAGATGGCCCCGAGCGAGCAAATTCCTACTGTCCGGCTGCGCGGCTACCGTGGCCGAGCT 
AGCAACCTTTCCCCTGGATCTCACAAAAACTCGACTCCAAATGCAAGGAGAAGCAGCTCTTGCTC 
GGTTGGGAGACGGTGCAAGAGAATCTGCCCCCTATAGGGGAATGGTGCGCACAGCCCTAGGGATC 
AT T GAAGAG GAAGGC T T T C T AAAG C T T T GG CAAG GAG T GACAC C C G C CAT T T AC AGAC ACG TAG T 
GTATTCTGGAGGTCGAATGGTCACATATGAACATCTCCGAGAGGTTGTGTTTGGCAAAAGTGAAG 
ATGAGCATTATCCCCTTTGGAAATCAGTCATTGGAGGGATGATGGCTGGTGTTATTGGCCAGTTT 
H T TAGCCAAT CCAAC TGACC TAGT GAAGGT T CAGATGCAAAT GGAAGGAAAAAGGAAAC TGGAAGG 

AAAAC CAT T GC GAT TTCGTGGT G T AC AT CAT G CAT T T G CAAAAAT CT TAG C T G AAGG AGGAATAC 
Jjt GAGG GCT T T GGGCAGGC T GGGT AC CCAAT ATACAAkGAGCAGC AC T GGT GAATATGGGAGAT T T A 

1 1 1 ACCACT TAT GATACAGTGAAACACTAC T T GGTATTGAATACACCACT T GAGGACAATATCATGAC 

HI TCACGGTTTATCAAGTTTATGTTCTGGACTGGTAGCTTCTATTCTGGGAACACCAGCCGATGTCA 
%l TCAAAAGCAGAATAATGAATCAACCACGAGATAAACAAGGAAGGGGACTTTTGTATAAATCATCG 

AC TGACT GC T T GAT TCAGGC TGTTCAAGGT GAAGGAT TCATGAGT CT ATATAAAGGC T T TT TACC 
i n ATCT TGGC T GAGAATGACCCCT T GG T CAAT GGTGTTCTGGC TT AC TT AT GAAAAAATCAGAGAGA 

fjf TGAGTGGAGTCAGTCCATTTTAA 
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FIGURE 126 

MSVPEEEERLLPLTQRWPRASKFLLSGCAATVAELATFPLDLTKTRLQMQGEAALARLGDGARES 
APYRGMVRTALGIIEEEGFLKLWQGVTPAIYRHWTSGGRMVTYEHLREWFGKSEDEHYPLWKS 
VI GGMMAGVT GQFIANPTDLVKVQMQMEGKRKLEGKPLRFRGVHHAFAKILAEGGIRGLWAGWVP 
NIQRAALVNMGDLTTYDTVKHYLVLNTPLEDNIMTHGLSSLCSGLVASILGTPADVIKSRIMNQP 
RDKQGRGLLYKSSTDCLIQAVQGEGFMSLYKGFLPSWLRMTPWSMVFWLTYEKIREMSGVSPF 



Transmembrane domains: 

amino acids 25-38, 130-147, 233-248 



111 



3 . !■ 
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FIGURE 127 

CGCGGATCGGACCCAAGCAGGTCGGCGGCGGCGGCAGGAGAGCGGCCGGGCGTCAGCTCCTCGAC 
CCCCGTGTCGGGCTAGTCCAGCGAGGCGGACGGGCGGCGTGGGCCCATGGCCAGGCCCGGCATGG 
AGCGGTGGCGCGACCGGCTGGCGCTGGTGACGGGGGCCTCGGGGGGCATCGGCGCGGCCGTGGCC 
CGGGCCCTGGTCCAGCAGGGACTGAAGGTGGTGGGCTGCGCCCGCACTGTGGGCAACATCGAGGA 
GCTGGCTGCTGAATGTAAGAGTGCAGGCTACCCCGGGACTTTGATCCCCTACAGATGTGACCTAT 
CAAATGAAGAGGACATCCTCTCCATGTTCTCAGCTATCCGTTCTCAGCACAGCGGTGTAGACATC 
TGCATCAACAATGCTGGCTTGGCCCGGCCTGACACCCTGCTCTCAGGCAGCACCAGTGGTTGGAA 
GGACATGTTCAATGTGAACGTGCTGGCCCTCAGCATCTGCACACGGGAAGCCTACCAGTCCATGA 
AGGAGCGGAATGTGGACGATGGGCACATCATTAACATCAATAGCATGTCTGGCCACCGAGTGTTA 
CCCCTGTCTGTGACCCACTTCTATAGTGCCACCAAGTATGCCGTCACTGCGCTGACAGAGGGACT 
GAGGCAAGAGCTTCGGGAGGCCCAGACCCACATCCGAGCCACGTGCATCTCTCCAGGTGTGGTGG 
AGACACAAT TCGCC T T CAAACT CCAC GACAAGGACC C T GAGAAGGCAGC TGCCACCTATGAGCAA 
ATGAAGTGTCTCAAACCCGAGGATGTGGCCGAGGCTGTTATCTACGTCCTCAGCACCCCCGCACA 
CATCCAGAT TGGAGACATCCAGATGAGGCC CACGGAGCAGG T GAC CTAG TGAC TGT GGGAGC TCC 
TCCTTCCCTCCCCACCCTTCATGGCTTGCCTCCTGCCTCTGGATTTTAGGTGTTGATTTCTGGAT 
CACGGGATACCACT T C CTG TCCACACCCCGACCAGGGGCTAGAAAAT T T GT T TGAGAT T T T TATA 
TCATCTTGTCAAATTGCTTCAGTTGTAAATGTGAAAAATGGGCTGGGGAAAGGAGGTGGTGTCCC 
TAATTGTTTTACTTGTTAACTTGTTCTTGTGCCCCTGGGCACTTGGCCTTTGTCTGCTCTCAGTG 
TCTTCCCTTTGACATGGGAAAGGAGTTGTGGCCAAAATCCCCATCTTCTTGCACCTCAACGTCTG 
TGGCTCAGGGCTGGGGTGGCAGAGGGAGGCCTTCACCTTATATCTGTGTTGTTATCCAGGGCTCC 
AGACTTCCTCCTCTGCCTGCCCCACTGCACCCTCTCCCCCTTATCTATCTCCTTCTCGGCTCCCC 
AGCCCAGTCTTGGCTTCTTGTCCCCTCCTGGGGTCATCCCTCCACTCTGACTCTGACTATGGCAG 
CAGAACACCAGGGCCTGGCCCAGT GGAT T T CAT GGT GATCAT TAAAAAAGAAAAATCGCAACCAA 
AAAAAAAAAA 
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FIGURE 128 



MARPGMERWRDRLALVTGAS GGI GAAVAEIAIjVQQGLKVVGCARTVGNIEE LAAECKSAG YPGTLI 
PYRCDLSNEEDILSMFSAIRSQHSGVDICINNAGLARPDTLLSGSTSGW 

EAYQSMKERNVDDGHIININSMSGHRVLPLSVTHFYSATKYAVTALTEGLRQELREAQTHIRATC 
ISPGWETQFAFKLHDKDPEKAAATYEQMKCLKPEDVAEAVIYVLSTPAHIQIGDIQMRPTEQVT 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

N-myristoylation sites. 

amino acids 18-24, 21-27, 22-28, 24-30, 40-46, 90-96, 109-115, 
199-205 

Short-chain alcohol dehyrogenase. 

amino acids 30-42, 104-114 
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FIGURE 12Q 



7 - 

III 

O 



AACTTCTACATGGGCCTCCTGCTGCTGGTGCTCTTCCTCAGCCTCCTGCCGGTGGCCTACACCAT 
CATGTCCCTCCCACCCTCCTTTGACTGCGGGCCGTTCAGGTGCAGAGTCTCAGTTGCCCGGGAGC 
ACCTCCCCTCCCGAGGCAGTCTGCTCAGAGGGCCTCGGCCCAGAATTCCAGTTCTGGTTTCATGC 
CAGCCTGTAAAAGGCCATGGAACTTTGGGTGAATCACCGATGCCATTTAAGAGGGTTTTCTGCCA 
GGATGGAAATGTTAGGTCGTTCTGTGTCTGCGCTGTTCATTTCAGTAGCCACCAGCCACCTGTGG 
CCGTTGAGTGCTTGAA ATGAG GAACTGAGAAAATTAATTTCTCATGTATTTTTCTCATTTATTTA 
T T AAT T T T TAAC TGATAG T TGTACATATT TGGGGGTACATGTGAT AT T T GGATACAT GTATACAA 
TATATAATGATCAAATCAGGGTAACTGGGATATCCATCACATCAAACATTTATTTTTTATTCTTT 
TTAGACAGAGTCTCACTCTGTCACCCAGGCTGGAGTGCAGTGGTGCCATCTCAGCTTACTGCAAC 
CTCTGCCTGCCAGGTTCAAGCGATTCTCATGCCTCCACCTCCCAAGTAGCTGGGACTACAGGCAT 
GCACCACAATGCCCAACTAATTTTTGTATTTTTAGTAGAGACGGGGTTTTGCCATGTTGCCCAGG 
CTGGCCTTGAACTCCTGGCCTCAAACAATCCACTTGCCTCGGCCTCCCAAAGTGTTATGATTACA 
GGCGTGAGCCACCGTGCCTGGCCTAAACATTTATCTTTTCTTTGTGTTGGGAACTTTGAAATTAT 
ACAATGAATTATTGTTAACTGTCATCTCCCTGCTGTGCTATGGAACACTGGGACTTCTTCCCTCT 
ATCTAACTGTATATTTGTACCAGTTAACCAACCGTACTTCATCCCCACTCCTCTCTATCCTTCCC 
AACCTCTGATCACCTCATTCTACTCTCTACCTCCATGAGATCCACTTTTTTAGCTCCCACATGTG 
AGTAAGAAAATGCAATATTTGTCTTTGTGTGCCTGGCTTATTTCACTTAACATAATGACTTCCTG 
T T C CAT C CAT G T T G C T GC AAAT G AC AG GAT TTCGTTCT T AAT T T C AAT T AAAAT AAC C AC AC AT G 
GCAAAAA 
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FIGURE 13Q 

MGLLLLVLFLSLLPVAYT IMSLPPS FDCGPFRCRVSVAREHLPSRGSLLRGPRPRI PVLVSCQPV 
KGHGTLGES PMPFKRVFCQDGNVRS FCVCAVHFS SHQPPVAYECLK 



Important features of the protein: 
Signal peptide: 

amino acids 1-18 

N-myristoylation site. 

amino acids .86-92 

Zinc carboxypeptidases , zinc-binding region 2 signature. 

amino acids 68-79 
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FIGURE 131 

T TCT GAAGT AAC GGAAGCT ACCTT GT AT AAAG AC CT C AAC ACTGCT GACC ATGATC AGCGCAGCCT GGAGC 
ATCTTCCTCATCGGGACTAAAATTGGGCTGTTCCTTCAAGTAGCACCTCTATCAGTTATGGCTAAATCCTG 
TCCATCTGTGTGTCGCTGCGATGCGGGTTTCATTTACTGTAATGATCGCTTTCTGACATCCATTCCAACAG 
GAAT ACCAGAGG AT GC T ACAACTCTCT ACC T TCAGAAC AACCAAAT AAAT AAT GCT GG GATTCCTT CAGAT 
T TGAAAAACT TG CTGAAAGT AGAAAG AAT AT ACCT AT ACC ACAAC AGT TT AG ATGAATTT CCT ACCAACCT 
CCCAAAGTATGTAAAAGAGTTACATTTGCAAGAAAATAACATAAGGACTATCACTTATGATTCACTTTCAA 
AAAT TCCCT ATC TGGAAGAATT AC ATT T AGAT GAC AACT C TGT CT C TGC AGTT AGC AT AGAAGAGGGAGC A 
TTCCGAGACAGCAACTATCTCCGACTGCTTTTCCTGTCCCGTAATCACCTTAGCACAATTCCCTGGGGTTT 
GCCCAGGACT AT AG AAGAACTACGCT T GG ATGAT AATC GC AT AT CC ACT ATTT C AT CACC AT CTCT TC AAG 
2? GTCT CACT AGTCT AAAAC GCCT GGT TC T AGAT GGAAAC CT GTTGAACAAT C AT GGTT T AGGT GAC AAAGTT 

ill TTCTTCAACCTAGTTAATTTGACAGAGCTGTCCCTGGTGCGGAATTCCCTGACTGCTGCACCAGTAAACCT 
I s 5 T CCAGGCAC AAACCT GAG GAAGCT T T ATCT TCAAGATAACCACATC AATC GGGT GCCCCC AAATGCTT TTT 

Ifl CTT ATCT AAGGCAGCT CTATCG ACT GGATATGTCCAAT AAT AAC CTAAGT AAT T TACCTC AG GGT ATC T T T 

l|| GATGAT TTGGAC AAT ATAAC AC AACT GATT CT TCGC AAC AATCC CT GGTATTGC GGGTGC AAGATG AAAT G 

GGT ACGTGAC TGGTT ACAAT CACT AC CT GT GAAGGT C AAC GT GC GTGGGCT C AT GT GCCAAGCCCC AG AAA 
- AGGTTC GTGGGATGGC TATT AAGGATCT CAAT GC AG AACT GT T T GATTGT AAGG ACAGTGGG ATTGT AAGC 

jsj? ACCATT CAGATAACCACTGC AAT AC C CAAC AC AGT GTATC CT GC CCAAGG AC AG TGGCCAGCT CCAGT GAC 

^| CAAACAGCC AGAT AT T AAGAACC C C AAGCT C ACT AAGGAT CAAC AAACCACAGG GAGTCC CT CAAGAAAAA 

CAAT TAC AATTACTGT GAAGTCTG TCACCT CTGATACCAT TCAT ATCT CT TGGAAACTTGCTCT ACCT AT G 

111 

£*4 ACTGCTTTGAGACTCAGCTGGCTTAAACTGGGCCATAGCCCGGCATTTGGATCTATAACAGAAACAATTGT 

£5* AACAGGGGAACGCAGTGAGTACTTGGTCACAGCCCTGGAGCCTGATTCACCCTATAAAGTATGCATGGTTC 

H.I 

CCATGGAAACCAGCAACCTCTACCTATTTGATGAAACTCCTGTTTGTATTGAGACTGAAACTGCACCCCTT 
CGAATGTACAAC CCT ACAAC C ACCCT CAAT CG AGAGCAAGAG AAAGAACCTT AC AAAAAC CCCAATTT AC C 
TTTGGCTGCCATCATTGGTGGGGCTGTGGCCCTGGTTACCATTGCCCTTCTTGCTTTAGTGTGTTGGTATG 
TTC AT AGGAATGGATCGCT C TT CT C AAGGAAC TGT GCAT ATAGC AAAGGGAGGAGAAGAAAGGAT GACT AT 
GC AGAAGCTGGC ACT AAGAAGGAC AACT CT AT CCTGGAAATCAGGGAAACTTCT TT T CAGAT GTT ACC AAT 
AAGCAATGAAC CCATC TCGAAGGAGGAGT T TGTAAT AC AC ACCAT ATTTC CTCCTAAT GGAATGAATC TGT 
AC AAAAAC AATCAC AGT GAAAGCAGT AG T AACCGAAGC T ACAGAGACAGT GGT ATT CCAGACTC AGAT C AC 
TC ACACTC ATGATGCT GAAGGACT C ACAGCAGACTT GT GTTT TGGGTT TT T T AAACCT AAGGGAGGTGAT G 
GT 
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FIGURE 132 



m 



MISAAWSIFLIGTKIGLFLQVAPLSVMAKSCPSVCRCDAGFIYCNDRFLTSIPTGIPEDATTLYL 
QNNQINNAGIPSDLKNLLKVERIYLYHNSLDEFPTNLPKYVKELHLQENNIRTITYDSLSKIPYL 
EELHLDDNSVSAVS I EEGAFRDSNYLRLL FLSRNHLS T I PWGLPRT IEELRLDDNRI S T I S S PSL 
QGLTSLKRLVLDGNLLNNHGLGDKVFFNLVNLTELSLVRNSLTAAPVNLPGTNLRKLYLQDNHIN 
RVPPNAFS YLRQLYRLDMSNNNLSNLPQGI FDDLDNI TQL I LRNNPWYCGCKMKWVRDWLQS LPV 
KVNVRGLMCQAPEKTOGMAI KDLNAE L FDCKDS GIVSTIQITTAI PNT VYPAQGQWPAPVTKQPD 
IKNPKLTKDQQTTGSPSRKT I T I TVKSVT SDT IHI S WKLALPMTALRLSWLKLGHSPAFGS I TET 
IVTGERSEYLVTALEPDSPYKVCMVPMETSNLYLFDETPVCIETETAPLRMYNPTTTLNREQEKE 
PYKNPNLPLAAIIGGAVALVTIALLAL^ 

ILEIRETSFQMLPISNEPISKEEFVIHTIFPPNGMNLYKNNHSESSSNRSYRDSGIPDSDHSHS 

Important features of the protein: 
Signal peptide: 

amino acids 1-28 
Transmembrane domain: 



|f| amino acids 531-552 



N-glycosylation sites. 

amino acids 226-229, 282-285, 296-299, 555-558, 626-629, 633-636 
Tyrosine kinase phosphorylation site, 
amino acids 515-522 
N-myristoylation sites. 

amino acids 12-17, 172-177, 208-213, 359-364, 534-539, 556-561, 
640-645 

Amidation site. 

amino acids 567-570 
Leucine zipper pattern. 

amino acids 159-180 

Phospholipase A2 aspartic acid active site. 

amino acids 34-44 
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FIGURE 133 

CCGTCATCCCCCTGCAGCCACCCTTCCCAGAGTCCTTTGCCCAGGCCACCCCAGGCTTCTTGGCA 
GCCCTGCCGGGCCACTTGTCTTC ATG TCTGCCRGGGGGAGGTGGGAaGGAGGTGGGAGGAGGGCG 
TGCAGAGGCAGTCTGGGCTTGGCCAGAGCTCAGGGTGCTGAGCGTGTGACCAGCAGTGAGCAGAG 
GCCGGCCATGGCCAGCCTGGGGCTGCTGCTCCTGCTCTTACTGACAGCACTGCCACCGCTGTGGT 
CCTCCTCACTGCCTGGGCTGGACACTGCTGAAAGTAAAGCCACCATTGCAGACCTGATCCTGTCT 
GCGCTGGAGAGAGCCACCGTCTTCCTAGAACAGAGGCTGCCTGAAATCAACCTGGATGGCATGGT 
GGGGGTCCGAGTGCTGGAAGAGCAGCTAAAAAGTGTCCGGGAGAAGTGGGCCCAGGAGCCCCTGC 
TGCAGCCGCTGAGCCTGCGCGTGGGGATGCTGGGGGAGAAGCTGGAGGCTGCCATCCAGAGATCC 
CTCCACTACCTCAAGCTGAGTGATCCCAAGTACCTAAGAGAGTTCCAGCTGACCCTCCAGCCCGG 
GTTTTGGAAGCTCCCACATGCCTGGATCCACACTGATGCCTCCTTGGTGTACCCCACGTTCGGGC 
CCCAGGACTCATTCTCAGAGGAGAGAAGTGACGTGTGCCTGGTGCAGCTGCTGGGAACCGGGACG 
GACAGCAGCGAGCCCTGCGGCCTCTCAGACCTCTGCAGGAGCCTCATGACCAAGCCCGGCTGCTC 
AGGCTACTGCCTGTCCCACCAACTGCTCTTCTTCCTCTGGGCCAGAATGAGGGGATGCACACAGG 
GACCACTCCAACAGAGCCAGGACTATATCAACCTCTTCTGCGCCAACATGATGGACTTGAACCGC 
AGAGCTGAGGCCATCGGATACGCCTACCCTACCCGGGACATCTTCATGGAAAACATCATGTTCTG 
TGGAATGGGCGGCTTCTCCGACTTCTACAAGCTCCGGTGGCTGGAGGCCATTCTCAGCTGGCAGA 
AACAGCAGGAAGGATGCT TCGGGGAGCCTGAT GC TGAAGATGAAGAAT TATC TAAAGC TAT TCAA 
TAT CAGCAGCAT TT T TCGAGGAGAGTGAAGAGGCGAGAAAAACAAT T T CCAGAT TCTCGCTCTGT 
TGCTCAGGCTGGAGTACAGTGGCGCAATCTCGGCTCACTGCAACCTTTGCCTCCTGGGTTCAAGC 
AATTCTCTTGCCTCATCCTCCCGAGTAGCTGGGACTACAGGAGCGTGCCACCATACCTGGCTAAT 
TTTTATATTTTTTTAGTAGAGACAGGGTTTCATCATGTTGCTCATGCTGGTCTCGAACTCCTGAT 
CTCAAGAGATCCGCCCACCTCAGGCTCCCAAAGTGTGGGATT ATAG GTGTGAGCCACCGTGTCTG 
GCT GAAAAGCAC TT T CAAAGAGACT GT GT T GAATAAAGGGCCAAGGT T C T T GCCACCCAGCAC TC 
ATGGGGGCTCTCTCCCCTAGATGGCTGCTCCTCCCACAACACAGCCACAGCAGTGGCAGCCCTGG 
GTGdCTTCCTATACATCCTGGCAGAATACCCCCCAGCAAACAGAGAGCCACACCCATCCACACCG 
CCACCACCAAGCAGCCGCTGAGACGGACGGTTCCATGCCAGCTGCCTGGAGGAGGAACAGACCCC 
TTTAGTCCTCATCCCTTAGATCCTGGAGGGCACGGATCACATCCTGGGAAGAAGGCATCTGGAGG 
ATAAGCAAAGCCACCCCGACACCCAATCTTGGAAGCCCTGAGTAGGCAGGGCCAGGGTAGGTGGG 
GGCCGGGAGGGACCCAGGTGTGAACGGATGAATAAAGTTCAACTGCAACTGAAAAAAAAAAA 
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FIGURE iaj. 

MSARGRWEGGGRRACRGSLGLARAQGAERVTS SEQRPAMASLGLLLLLLLTALPPLWSS SLPGLD 
TAESKAT I ADL I LS ALERATVFLEQRLPE I NLDGMVGVRVLEEQLKS VREKWAQE PLLQPLS LRV 
GMLGEKLEAAIQRSLHYLKLSDPKYLREFQLTLQPGFWKLPHAWIHTDASLVYPTFGPQDSFSEE 
RSDVCLVQLLGTGTDSSEPCGLSDLCRSLMTKPGCSGYCLSHQLLFFLWARMRGCTQGPLQQSQD 
YINLFCANMMDLNRRAEAIGYAYPTRDIFMENIMFCGMGGFSDFYKLRWLEAILSWQKQQEGCFG 
EPDAEDEELSKAIQYQQHFSRRVKRREKQFPDSRSVAQAGVQWRNLGSLQPLPPGFKQFSCLILP 
SSWDYRSVPPYLANFYIFLVETGFHHVAHAGLELLISRDPPTSGSQSVGL 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 39-56 

Tyrosine kinase phosphorylation sites. 

amino acids 149-156, 274-282 

N-myristoylation sites . 

amino acids 10-16, 20-26, 63-69, 208-214 

Ami da ti on site. 

amino acids 10-14 

Glycoprotein hormones beta chain signature 1 . 

amino acids 230-237 



APPJD=10063594 



Page 286 



FIGURE 135 

GGTCTGAGTGCAGAGCTGCTGTC ATG GCGGCCGCTCTGTGGGGCTTCTTTCCCGTCCTGCTGCTG 
CTGCTGCTATCGGGGGATGTCCAGAGCTCGGAGGTGCCCGGGGCTGCTGCTGAGGGATCGGGAGG 
GAGTGGGGTCGGCATAGGAGATCGCTTCAAGATTGAGGGGCGTGCAGTTGTTCCAGGGGTGAAGC 
CTCAGGACTGGATCTCGGCGGCCCGAGTGCTGGTAGACGGAGAAGAGCACGTCGGTTTCCTTAAG 
ACAGATGGGAGTTTTGTGGTTCATGATATACCTTCTGGATCTTATGTAGTGGAAGTTGTATCTCC 
AGCT TACAGATT TGATCCCGTT C GAG T GGATAT C AC T T C GAAAG GAAAAAT G AGAGC AAGAT AT G 
T GAATTACATCAAAACATCAGAGGT T G T CAGAC T GCCC TATCCT CT CCAAAT GAAATC T TCAGGT 
hk C C AC C T T C T TAG T T TAT TAAAAG GGAAT CGTGGGGC T GGACAGAC T T T CT AAT GAACC CAAT GG T 

O TATGATGATGGTTCTTCCTTTATTGATATTTGTGCTTCTGCCTAAAGTGGTCAACACAAGTGATC 
?f CTGACATGAGACGGGAAATGGAGCAGTCAATGAATATGC TGAAT TCCAACCATGAGTTGCCTGAT 

jfjjjj G T T T C T GAG T T CAT GACAAGAC TCTTCTCTT C AAAAT CAT C T GG C AAAT C TAG C AGC GG CAGC AG 

Iff TAAAACAGGCAAAAGTGGGGCTGGCAAAAGGAGG TAG TCAGGCCGTCCAGAGCTGGCATTTGCAC 
U.1 AAACACGGCAACACTGGGTGGCATCCAAGTCTTGGAAAACCGTGTGAAGCAACTACTATAAACTT 
4* GAGTCATCCCGACGTTGATCTCTTACAACTGTGTATGTT 

* AACTTTTTAGCACATGTTTTGTACTTGGTACACGAGAAAACCCAGCTTTCATCTTTTGTCTGTAT 

GAGGTCAATATTGATGTCACTGAATTAATTACAGTGTCCTATAGAAAATGCCAT TAATAAAT TAT 
£3 ATGAACTAC T ATACAT TAT GTATAT TAATTAAAACATC T TAAT C CAGAAAT CAAAAAAAAAAAAA 

ill AAAAAAAAAAAAAAA 

pi 3 
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FIGURE 126 

MAAALWGFFPVLLLLLLS GDVQS SE VPGAAAEGS GGSGVG I GDRFKI EGRAWPGVKPQDW I SAA 
RVLVDGEEHVGFLKTDGS FWHD IPS GSYWEWS PAYRFDPVRVDI T SKGKMRARYVNYI KTSE 
VTOLPYPLQMKSSGPPSYFIKRESWGWTDFLMNPMVMMMVLPLLI^ 
QSMNMLNSNHELPDVSEFMTRLFSSKSSGKSSSGSSKTGKSGAGKRR 



Important features of the protein: 
Signal sequence: 

amino acids 1-23 



Transmembrane domain: 

amino acids 161-182 



\U N-glycosylation site. 

r 

*** amino acids 184-187 



||| Glycosaminoglycan attachment sites. 

Q amino acids 37-40, 236-239 



cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 151-154 

N-myristoylation sites. 

amino acids 33-38, 36-41, 38-44, 229-234 



Amidation site. 

amino acids 238-241 



ATP/GTP-binding site motif A (P-loop) . 

amino acids 229-236 
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FIGURE 15*7 

GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGGGA 
CCAGAAGGGTGAGCTACGTTGGCTTTCTGGAAGGGGAGGCTATATGCGTCAATTCCCCAAAACAA 
GTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTTCCAGGC 
CTTACCTGCTGGGCACTAACGGCGGAGCCAGGATGGGGACAGAATAAAGGAGCCACGACCTGTGC 
CACCAAC T CGCACTCAGAC TCTGAACT CAGACC TGAAATC T TCT CT TCACGGGAGGC T TGGCAGT 
TTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGATGAAAGCCTCTAGTCTTGCCTTCAGC 
CTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGGACTGAAGACACTCAATTTGGG 
AAGC T GT GTGATCGCCACAAACC TT CAGGAAATACGAAATGGAT T T T C TGAGATACGGGGCAGTG 
TGCAAGCCAAAGAT GGAAACAT T GACATCAGAATC T TAAGGAGGACT GAGT C T TT GCAAGACACA 
AAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTCTATCTGGACAGGGTATTTAA 
AAACTACCAGACCCCTGACGATTATACTCTCCGGAAGATCAGCAGCCTCGCCAATTCCTTTCTTA 
CCATCAAGAAGGACCTCCGGCTCTCTCATGCCCACATGACATGCCATTGTGGGGAGGAAGCAATG 
AAGAAATACAGCCAGATTC TGAGTCACTT TGAAAAGCTGGAACCTCAGGCAGCAGTTGTGAAGGC 
T TT GGGGGAAC TAGACAT TCT T C TGCAAT GGAT GGAGGAGACAGAATAGGAGGAAAGT GAT GCTG 
CTGCTAAGAATATTCGAGGTCAAGAGCTCCAGTCTTCAATACCTGCAGAGGAGGCATGACCCCAA 
ACCACCATCTCTTTACTGTACTAGTCTTGTGCTGGTCACAGTGTATCTTATTTATGCATTACTTG 
CTTCCTTGCATGATTGTCTTTATGCATCCCCAATCTTAATTGAGACCATACTTGTATAAGATTTT 
TGTAATATCTTTCTGCTATTGGATATATTTATTAGTTAATATATTTATTTATTTTTTGCTATTTA 
ATGTATT TATT TTTT TAC TTGGACATGAAACT TTAAAAAAATTCACAGATTATAT TTATAACCTG 
ACTAGAGCAGGT GATGTAT T TT TATACAGT AAAAAAAAAAAACC TT GTAAAT TC TAGAAGAGTGG 
CTAGGGGGGT TAT TCATTTGTATTCAACTAAGGACATATT TACT CATGCTGATGCTCTGTGAGAT 
AT T T GAAAT T GAACCAATGAC TACT TAGGATGGGT TGTGGAATAAGT T TT GATGTGGAAT TGCAC 
ATCTACCTTACAATTACTGACCATCCCCAGTAGACTCCCCAGTCCCATAATTGTGTATCTTCCAG 
CCAGGAATCCTACACGGCCAGCATGTATT TCTACAAATAAAGT T TTCTTTGCATACCAAAAAAAA 
AAAAAAAAAAA 
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FIGURE 138 

MRQFPKTSFDISPEMSFSIYSLQVPAVPGLTCWALTAEPGWGQNKGATTCATNSHSDSELRPEIF 
SSREAWQFFLLLWSPDFRPKMKASSLAFSLLSAAFYLLWTPSTGLKTLNLGSCVIATNLQEIRNG 
FSE I RGS VQAKDGN I D I R I LRRTES LQDTKPANRCCLLRHLLRL YLDRVFKNYQT PDH YTLRKI S 
S LANS FLT IKKDLRL S HAHMTC^ 
E 

Important features of the protein: 
Signal peptide: 

amino acids 1-42 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 192-195, 225-228 

N-myristoylation sites. 

amino acids 42-47, 46-51, 136-141 
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FIGURE 13Q 



CCTGGAGCCGGAAGCGCGGCTGCAGCAGGGCGAGGCTCCAGGTGGGGTCGGTTCCGCATCCAGCC 
TAGCGTGTCCACGATGCGGCTGGGCTCCGGGACTTTCGCTACCTGTTGCGTAGCGATCGAGGTGC 
TAGGGATCGCGGTCTTCCTTCGGGGATTCTTCCCGGCTCCCGTTCGTTCCTCTGCCAGAGCGGAA 
CACGGAGCGGAGCCCCCAGCGCCCGAACCCTCGGCTGGAGCCAGTTCTAACTGGACCACGCTGCC 
ACCACCT CT CT TCAGTAAAGTT GT T AT TGT T C T GATAGATGCCT T GAGAGAT GAT TTTGTGTTTG 
GGTCAAAGGGTGTGAAATT TATGCCCTACACAAC TTACCT TGTGGAAAAAGGAGCATC TCACAGT 
TTTGTGGCTGAAGCAAAGCCACCTACAGTTACTATGCCTCGAATCAAGGCATTGATGACGGGGAG 
CCTTCCTGGCTTTGTCGACGTCATCAGGAACCTCAATTCTCCTGCACTGCTGGAAGACAGTGTGA 
TAAGACAAGCAAAAGCAGC TGGAAAAAGAATAGTCT T T TATGGAGAT GAAACC T GGGT TAAAT T A 
T T CCCAAAGCAT T T T GTGGAATATGAT GGAACAACC TCAT T T TT CGT GT CAGATTACACAGAGG T 
GGATAATAATGTCACGAGGCATTTGGATAAAGTATTAAAAAGAGGAGATTGGGACATATTAATCC 
TCCACTACCTGGGGCTGGACCACATTGGCCACATTTCAGGGCCCAACAGCCCCCTGATTGGGCAG 
Q AAGCTGAGCGAGATGGACAGCGTGCTGATGAAGATCCACACCTCACTGCAGTCGAAGGAGAGAGA 
O GACGCCTTTACCCAATTTGCTGGTTCTTTGTGGTGACCATGGCATGTCTGAAACAGGAAGTCACG 
fi\ GGGCCTCCTCCACCGAGGAGGTGAATACACCTCTGATTTTAATCAGTTCTGCGTTTGAAAGGAAA 
Y[\ CCCGGTGATATCCGACATCCAAAGCACGTCCAATAGACGGATGTGGCTGCGACACTGGCGATAGC 
;J ACTTGGCTTACCGATTCCAAAAGACAGTGTAGGGAGCCTCCTATTCCCAGTTGTGGAAGGAAGAC 
ill CAATGAGAGAGCAGTTGAGATTTTTACATTTGAATACAGTGCAGCTTAGTAAACTGTTGCAAGAG 
SSI AATGTGCCGTCATATGAAAAAGATCCTGGGTTTGAGCAGTTTAAAATGTCAGAAAGATTGCATGG 
Jf GAACTGGATCAGACTGTACTTGGAGGAAAAGCATTCAGAAGTCCTATTCAACCTGGGCTCCAAGG 

TTCTCAGGCAGTACCTGGATGCTCTGAAGACGCTGAGCTTGTCCCTGAGTGCACAAGTGGCCCAG 
* TTCTCACCCTGCTCCTGCTCAGCGTCCCACAGGCACTGCACAGAAAGGCTGAGCTGGAAGTCCCA 
CI CTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTGGTTCTTTCGGCCGTTCACGT 
l|| CATTGTGTGCACCTCAGGTGAAAGTTCGTGCTACTTCTGTGGCCTCTCGTGGCTGGCGGCAGGCT 
f*) GCCTTTCGTTTACCAGACTCTGGTTGAACACCTGGTGTGTGCCAAGTGCTGGCAGTGCCCTGGAC 
H\ AGGGGGCCTCAGGGAAGGACGTGGAGCAGCCTTATCCCAGGCCTCTGGGTGTCCCGACACAGGTG 
J; TTCACATCTGTGCTGTCAGGTCAGATGCCTCAGTTCTTGGAAAGCTAGGTTCCTGCGACTGTTAC 
O CAAGGTGATTGTAAAGAGCTGGCGGTCACAGAGGAACAAGCCCCCCAGCTGAGGGGGTGTGTGAA 
fll T CGGACAGC CTCCCAGCAGAGG T G T GGGAGCTGCAGCT GAGGGAAGAAGAGACAAT CGGCCTGGA 

CACTCAGGAGGGTCAAAAGGAGACTTGGTCGCACCACTCATCCTGCCACGCCCAGAATGCATCCT 
GCCTCATCAGGTCCAGATTTCTTTCCAAGGCGGACGTTTTCTGTTGGAATTCTTAGTCCTTGGCC 
TCGGACACC TT CATT CGT TAGCT GGGGAGT GGTGGTGAGGCAGT GAAGAAGAGGCGGATGGT CAC 
ACTCAGATCCACAGAGCCCAGGATCAAGGGACCCACTGCAGTGGCAGCAGGACTGTTGGGCCCCC 
ACCCCAACCCTGCACAGCCCTCATCCCCTCTTGGCTTGAGCCGTCAGAGGCCCTGTGCTGAGTGT 
C T GACCGAGACAC TCACAGC TT T GT CATCAGGGCACAGGC T TCCT CGGAGCCAGGATGATC T GT G 
CCACGCTTGCACCTCGGGCCCATCTGGGCTCATGCTCTCTCTCCTGCTATTGAATTAGTACCTAG 
CT GCACACAGTATGTAGT TACCAAAAGAATAAACGGCAATAAT TGAGAAAAAAAA 
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FIGURE 1AO 

MRLGS GT FATCC VAI E VLGI AVFLRGFFPAPVRS SARAEHGAE PPAPEPS AGAS SNWT TL PPPLF 
SKWIVLIDALRDDFVFGSKGVKFMPYTTYLVEKGASHSFVAEAKPPTVTMPRIKALMTGSLPGF 
VDVI RNLNS PALLEDS VTRQAKAAGKRI VFYGDE TWVKL FPKH FVE YDGT TS FFVSDYTEVDNNV 
TRHLDKVLKRGDWDILILHYLGLDHIGHISGPNSPLIGQKLSEMDSVLMKIHTSLQSKERETPLP 
NLLVLCGDHGMSETGSHGASSTEEVNTPLILISSAFERKPGDIRHPKHVQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-34 

Transmembrane domain: 

amino acids 58-7 6 

N-glycosylation sites . 

amino acids 56-60, 194-198 

N-myristoylation sites. 

amino acids 6-12, 52-58, 100-106, 125-131, 233-239, 270-276, 
275-281, 278-284 

Amidation site. 

amino acids 154-158 

Cell attachment sequence. 

amino acids 205-208 
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FIGURE 141 

GGCACGAGGCAAGCCTTCCAGGTTATCGTGACGCACCTTGAAAGTCTGAGAGCTACTGCCCTACA 
GAAAGTTACTAGTGCCCTAAAGCTGGCGCTGGCACTGATGTTACTGCTGCTGTTGGAGTACAACT 
T CCC TATAGAAAACAACTGCCAGCACCTT AAGACCAC TCACACC TT CAGAGT GAAGAAC T TAAAC 
C CGAAGAAAT TCAGCATTCATGACCAGGATCACAAAGTAC TGG T CCT GGAC T CT GGGAA.TC TCAT 
AGCAGT TCCAGATAAAAAC TACATACGCC CAGAGATC T TC T T TGCAT TAGC CTCAT CC T TGAGCT 
CAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTCTCTAAAGGGGAGTTTTGTCTCTAC 
TGTGACAAGGATAAAGGACAAAGTCAT CCAT CC CT TCAGC TGAAGAAGGAGAAAC TGATGAAGCT 
GGCTGCCCAAAAGGAATCAGCACGCCGGCCCTTCATCTTTTATAGGGCTCAGGTGGGCTCCTGGA 
ACATGCTGGAGTCGGCGGCTCACCCCGGATGGTTCATCTGCACCTCCTGCAATTGTAATGAGCCT 
GT TGGGG T GACAGATAAAT T TGAGAACAGGAAACACAT TGAAT T T T CAT TT CAACCAGT T T GCAA 
AGCT GAAATGAGCCCCAGTGAGGT CAGCGAT TAG GAAACT GCCC CAT T GAACGCC TTCC TCGC TA 
AT T T GAACTAAT TGTATAAAAACACCAAACC TGC TCAC T 
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FIGURE 142 

MLLLLLE YNFP I ENNCQHLKTTHT FRVKNLNPKKFS I HDQDHKVLVLDSGNL I AVPDKNY IRPEI 
FFAIASSLSSASAEKGSPILLGVSKGSFCLYCDKDKGQSHPSLQLKKEKLMKLAAQKESARRPFI 
FYRAQVGSWNMLESAAHPGWFICTSCNCNEPVGVTDKFENRKHIEFSFQPVCPCAEMSPSEVSD 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 33-36 

N-myristoylation site* 

amino acids 50-55, 87-92 

Interleukin-1 

amino acids 37-182 



APP ID=10063594 



Page 294 



FIGURE 143 

C TAGAGAGTATAGGGCAGAAGGAT GGCAGAT GAGT GACTCCACAT CCAGAGCTGC CTCCCT T TAA 
TCCAGGATCCTGTCCTTCCTGTCCTGTAGGAGTGCCTGTTGCCAGTGTGGGGTGAGACAAGTTTG 
TCCCACAGGGCTGTCTGAGCAGATAAGATTAAGGGCTGGGTCTGTGCTCAATTAACTCCTGTGGG 
CACGGGGGCTGGGAAGAGCAAAGTCAGCGGTGCCTACAGTCAGCACCATGCTGGGCCTGCCGTGG 
AAGGGAGGTCTGTCCTGGGCGCTGCTGCTGCTTCTCTTAGGCTCCCAGATCCTGCTGATCTATGC 
CTGGCATTTCCACGAGCAAAGGGACTGTGATGAACACAATGTCATGGCTCGTTACCTCCCTGCCA 
CAGT GGAG T TT GC TGTCCACACAT TCAACCAACAGAGCAAGGAC TAC TAT GCCTACAGAC T GjGGG 
CACATCTTGAATTCCTGGAAGGAGCAGGTGGAGTCCAAGACTGTATTCTCAATGGAGCTACTGCT 
GGGGAGAAC TAGG TGTGGGAAAT T TGAAGACGACAT T GACAACT GCCAT T T CCAAGAAAGCACAG 
AGCTGAACAATACTTTCACCTGCTTCTTCACCATCAGCACCAGGCCCTGGATGACTCAGTTCAGC 
CTCCTGAACAAGACCTGCTTGGAGGGATTCCAC TGAG TGAAACCCACTCACAGGCTTGTCCATGT 
GCTGCTCCCACATTCCGTGGACATCAGCACTACTCTCCTGAGGACTCTTCAGTGGCTGAGCAGCT 
TTGGACTTGTTTGTTATCCTATTTTGCATGTGTTTGAGATCTCAGATCAGTGTTTTAGAAAATCC 
AC AC AT C T T GAGC C TAAT CAT G TAG T G T AGATCAT TAAACAT CAG CAT T T T AAGAAAAAAAAAAA 
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FIGURE iaa 

MLGL PWKGGLS WALLLLLLGSQ I LLI YAWHFHEQRDC DEHNVMARYLPAT VE FAVHT FNQQS KDY 
YAYRLGHILNSWKEQVESKTVFSMELLLGRTRCGKFEDDIDNCHFQESTELNNTFTCFFTISTRP 
WMTQFSLLNKTCLEGFH 



Important features of the protein: 
Signal peptide: 

amino acids 1-25 



Q N-glycosylation sites. 

W amino acids 117-121, 139-143 

W 

ill N-myristoylation site. 

%ll amino acids 9-15 
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FIGURE 145 



C TGT GCAGC TCGAGGC TCCAGAGGCACACT CCAGAGAGAGCCAAGGT T CT GACGCG ATGA GGAAG 
CACCTGAGCTGGTGGTGGCTGGCCACTGTCTGCATGCTGCTCTTCAGCCACCTCTCTGCGGTCCA 
GACGAGGGGCATCAAGCACAGAATCAAGT GGAACCGGAAGGCCCTGCCCAGCAC TGCCCAGATCA 
CTGAGGCCCAGGTGGCTGAGAACCGCCCGGGAGCCTTCATCAAGCAAGGCCGCAAGCTCGACATT 
GACTTCGGAGCCGAGGGCAACAGGTACTACGAGGCCAACTACTGGCAGTTCCCCGATGGCATCCA 
CTACAACGGCTGCTCTGAGGCTAATGTGACCAAGGAGGCATTTGTCACCGGCTGCATCAATGCCA 
CCCAGGCGGCGAACCAGGGGGAGTTCCAGAAGCCAGACTUVCAAGCTCCACCAGCAGGTGCTCTGG 
CGGCTGGTCCAGGAGCTCTGCTCCCTCAAGCATTGCGAGTTTTGGTTGGAGAGGGGCGCAGGACT 
\*$ TCGGGTCACCATGCACCAGCCAGTGCTCCTCTGCCTTCTGGCTTTGATCTGGCTCATGGTGAAAT 
AAGCTTGCCAGGAGGCTGGCAGTACAGAGCGCAGCAGCGAGCAAATCCTGGCAAGTGACCCAGCT 
|y CTTCTCCCCCAAACCCACGCGTGTTCTGAAGGTGCCCAGGAGCGGCGATGCACTCGCACTGCAAA 
Iff TGCCGCTCCCACGTATGCGCCCTGGTATGTGCCTGCGTTCTGATAGATGGGGGACTGTGGCTTCT 
^ CCGTCACTCCATTCTCAGCCCCTAGCAGAGCGTCTGGCACACTAGATTAGTAGTAAATGCTTGAT 
^ GAGAAGAACACATCAGGCACTGCGCCACC T GC T TCACAGTACT TCCCAACAACT CT TAGAGGTAG 

GT GTAT TCCCGT T T TACAGATAAGGAAACT GAGGCCCAGAGAGC T GAAGT AC TGCACCCAGCAT C 
ACCAGCTAGAAAGTGGCAGAGCCAGGATTCAACCCTGGCTTGTCTAACCCCAGGTTTTCTGCTCT 
GTCCAATTCCAGAGCTGTCTGGTGATCACTTTATGTCTCACAGGGACCCACATCCAAACATGTAT 
CTC TAAT GAAAT TGTGAAAGC T CCAT GT T T AGAAATAAAT GAAAACACC T GA 
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FIGURE 146 

MRKHLSWWWIATVCMLLFSHLSAVQTR 

LDIDFGAEGNRYYEANYWQFPDGIHYNGCSEANVTKEAFVTGCINATQAANQGEFQKPDNKLHQQ 
VLWRLVQELCSLKHCEFWLERGAGLRVTMHQPVLLCLLALIWLMVK 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 157-171 

N-gly cosyla ti on si tes . 

amino acids 98-102, 110-114 

Tyrosine kinase phosphorylation site. 

amino acids 76-83 

N-myristoylation sites . 

amino acids 71-77, 88-94, 93-99, 107-113, 154-160 

Amidation site. 

amino acids 62-66 
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FIGURE 147 



GCCTTGGCCTCCCAAAGGGCTGGGATTATAGGCGTGACCACCATGTCTGGTCCAGAGTCTCATTT 
CCTGATGATTTATAGACTCAAAGAAAACTC ATG TTCAGAAGCTCTCTTCTCTTCTGGCCTCCTCT 
CTGTCTTCTTTCCCTCTTTCTTCTTATTTTAATTAGTAGCATCTACTCAGAGTCATGCAAGCTGG 
AAATCTTTCATTTTGCTTGTCAGTGGGGTAGGTCACTGAGTCTTAGTTTTTATTTTTTGAAATTT 
CAACTTTCAGATTCAGGGGGTACATGTGAAGGTTTGTTTTATGAGTATATTGCATGATGCTGAGG 
TTTGGGGT 
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FIGURE 148 



MFRSSLLFWPPLCLLSLFLLILISSIYSESCKLEIFHFACQWGRSLSLSFYFLKFQLSDSGGT.CE 
GLFYEYIA 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-myristoylation site. 

amino acids 62-68 
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FIGURE 14Q 

GTCTCCGCGTCACAGGAACTTCAGCACCCACAGGGCGGACAGCGCTCCCCTCTACCTGGAGACTTGAC 
TCCCGCGCGCCCCAACCCTGCTTATCCCTTGACCGTCGAGTGTCAGAGATCCTGCAGCCGCCCAGTCC 
CGGCCCCTCTCCCGCCCCACACCCACCCTCCTGGCTCTTCCTGTTTTTACTCCTCCTTTTCATTCATA 
ACAAAAGCTACAGCTCCAGGAGCCCAGCGCCGGGCTGTGACCCAAGCCGAGCGTGGAAGA ATG GGGTT 
CCTCGGGACCGGCACTTGGATTCTGGTGTTAGTGCTCCCGATTCAAGCTTTCCCCAAACCTGGAGGAA 
GCC AAGAC AAAT C T C TACAT AATAGAGAATT AAGT G CAGAAAGACC T T T GAAT GAACAGAT T GCT GAA 
GCAGAAGAAGACAAGAT TAAAA?VAACATATC CT CCAGAAAACAAGCCAGGTCAGAGCAAC TAT TCTTT 
TGT T GAT AACT T GAAC C T GC T AAAGGCAATAAC AGAAAAG GAAAAAAT T GAGAAAGAAAGACAAT CT A 
Q TAAGAAGCTCCCCACTTGATAATAAGTTGAA.TGTGGAAGATGTTGATTCAACC^AGAATCGAAAA.CTG 
Q ATCGATGATTAT GACT CTAC TAAGAGT GGATT GGAT CATAAATTT CAAGATGATCCAGAT GGT CT TCA 

y I TCAACTAGACGGGACTCCTT TAACCGCTGAAGACAT T GTCCATAAAAT CGCT GCCAGGAT T TATGAAS 

f£ AAAATGACAGAGC CGTGTTT GACAAGATT GTTTCTAAACTACTTAATCT CGGCCT TATCACAGAAAGC 

"tf CAAGCACATACACTGGAAGATGAAGTAGCAGAGGTT TTACAAAAATTAATCT CAAAGGAAGCCAACAA 

p TTATGAGGAGGATCCCAATAAGCC CACAAGC T GGACTGAGAATCAGGCT GGAAAAATACCAGAGAAAG 

9 TGACTCCAATGGCAGCAATTCAAGATGGTCTTGCTAAGGGAGAAAACGATGAAACAGTATCTAACACA 
Q T TAACC T T GACAAAT GG CT T GGAAAGGAGAAC T AAAACC TACAGT GAAGACAACT T T GAG GAAC T CCA 

HI ATATT TCCCAAATT TCT AT GCGC TACT GAAAA.GTAT TGATT CAGAAAAAG7\AGCAAAAGAGAAAGAAA. 

W CACTGATTACTATCATGAAAACACTGATTGACTTTGTGAAGATGATGGTGAAATATGGAACAATATCT 
J! CCAGAAGAAGGTGTTTCCTACCTTGA^AACTTGGATGAAATGATTGCTCTTCAGACCAAAAACAAGCT 

Sri AGAAAAAAATGCTACTGACAATATAAGCAAGCTTTTCCCAGCACCATCAGAGAAGAGTCATGAAGAAA 

HI 

CAGACAGTACCAAGGAAGAAGCAGCTAAGATGGAAAAGGAATAT GGAAGCT TGAAGGATTCCACAAAA 
GATGATAACTCCAACCCAGGAGGAAAGACAGATGAACCCAAAGGAAAAACAGAAGCCTATTTGGAAGC 
CAT CAGAAAAAATAT TGAAT GGT T GAAGAAACATGACAAAAAGGGAAATAAA.GAAGATTATGACCT TT 
CAAAGATGAGAGACTTCATCAATAAACAAGCTGATGCTTATGTGGAGAAAGGCATCCTTGACAAGGAA 
GAAGCCGAGGCCATCAAGCGCATTTATAGCAGCCTGTAAAAATGGCAAAAGATCCAGGAGTCTTTCAA 
CTGT TTCAGAAAACATAATATAGCTTAAAACACTT CTAAT TCTGT GAT TAAAATT TTTTGACCCAAGG 
GTTATTAGAAAGTGCTGAATTTACAGTAGTTAACCTTTTACAAGTGGTTAAAACATAGCTTTCTTCCC 
GTAAAAACTATCTGAAAGTAAAGTTGTATGTAAGCTGAAAAAAAAAAAAAAAAAAAA 
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FIGURE ISO 

MGFLGTGTWILVLVLPIQAFPKPGGSQDKSLHNRELSAERPLNEQIAEAEEDKIKKTYPPENKPG 
Q S NY S FVDNLNL LKA I TE KE K I E KE RQ S I RS S P L DNKLNVE DVD S TKNRKL I DD YD S T KS GL DHK 
FQDDPDGLHQLDGTPLTAEDIVHKIAARIYEENDRAVFDKIVSKLLNLGLITESQAHTLEDEVAE 
VLQKLISKEANNYEEDPNKPTSWTENQAGKIPEKVTPMAAIQDGLAKGENDETVSNTLTLTNGLE 
RRTKTYSEDNFEELQYFPNFYALLKSIDSEKEAKEKETLITIMKTLIDFVKMMVKYGTISPEEGV 
SYLENLDEMIALQTKNKLEKNATDNISKLFPAPSEKSHEETDSTKEEAAKMEKEYGSLKDSTKDD 
NS NP G GKT DE P KG K T E AYL E AI RKN I E WLKKH DKKGNKE D YDL S KMRD F I NK QADAYVEKG I LDK 
EEAEAIKRIYSSL 

N-glycosylation sites: 

amino acids 68-71/ 346-349, 350-353 

Casein kinase II phosphorylation site: 

amino acids 70-73, 82-85, 97-100, 125-128, 147-150, 188-191, 217- 

220, 265-268, 289-292, 305-308, 320-323, 326-329, 362-365, 368- 
341, 369-372, 382-385, 386-389, 387-390 

N-myristoylation sites: 

amino acids 143-148, 239-244 
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FIGURE 151 



CGGCTCGAGGCTCCCGCCAGGAGAAAGGAACATTCTGAGGGGAGTCTACACCCTGTGGAGCTCAA 
GATGGTCCTGAGTGGGGCGCTGTGCTTCCGAATGAAGGACTCGGCATTGAAGGTGCTTTATCTGC 
ATAATAACCAGCTTCTAGCTGGAGGGCTGCATGCAGGGAAGGTCATTAAAGGTGAAGAGATCAGC 
GTGGTCCCCAATCGGTGGCTGGATGCCAGCCTGTCCCCCGTCATCCTGGGTGTCCAGGGTGGAAG 
CCAGTGCCTGTCATGTGGGGTGGGGCAGGAGCCGACTCTAACACTAGAGCCAGTGAACATCATGG 
AGCTCTATCTTGGTGCCAAGGAATCCAAGAGCTTCACCTTCTACGGGCGGGACATGGGGCTCACC 
TCCAGCTTCGAGTCGGCTGCCTACCCGGGCTGGTTCCTGTGCACGGTGCCTGAAGCCGATCAGCC 
TGTCAGACTCACCCAGCTTCCCGAGAATGGTGGCTGGAATGCCCCCATCACAGACTTCTACTTCC 
AGCAGTGTGACTAGGGCAACGTGCCCCCCAGAACTCCCTGGGCAGAGCCAGCTCGGGTGAGGGGT 
GAGT GGAGGAGACCCATGGCGGACAAT CAC T CT C TC T GCTC TCAGGACCCCCACGT CTGAC T TAG 
T GGGCACC T GACCAC T T TGTCTT CTGG TT CCCAG T T T GGATAAAT T C T GAGATT T GGAGCTCAGT 
CCACGGTCCTCCCCCACTGGATGGTGCTACTGCTGTGGAACCTTGTAAAAACCATGTGGGGTAAA 
CTGGGAATAACATGAAAAGATTTCTGTGGGGGTGGGGTGGGGGAGTGGTGGGAATCATTCCTGCT 
TAATGGTAACTGACAAGTGTTACCCTGAGCCCCGCAGGCCAACCCATCCCCAGTTGAGCCTTATA 
GGGTCAGTAGCTCTCCACATGAAGTCCTGTCACTCACCACTGTGCAGGAGAGGGAGGTGGTCATA 
GAGTCAGGGATCTATGGCCCTTGGCCCAGCCCCACCCCCTTCCCTTTAATCCTGCCACTGTCATA 
TGCTACCTTTCCTATCTCTTCCCTCATCATCTTGTTGTGGGCATGAGGAGGTGGTGATGTCAGAA 
GAAAT GGC T CGAGC TCAGAAGATAAAAGATAAGTAGGGTAT GC T GAT CCT CT TT TAAAAAC CCAA 
GATACAATCAAAATCC CAGATGCT GGT CT CT AT T CC CATGAAAAAG T GCTCATGACATATTGAGA 
AGACCTACTTACAAAGTGGCATATATTGCAATT TATT TTAATTAAAAGATACCTAT TTATATATT 
T C T T TATAGAAAAAAG TC TGGAAGAGT TTAC TT CAAT TGTAGCAATGT CAGGGT GGTGGCAGTAT 
AGGTGATTTTTCTTTTAATTCTGTTAATTTATCTGTATTTCCTAATTTTTCTACAATGAAGATGA 
AT TCC TT GT ATAAAAATAAGAAAAGAAAT T AATC TTGAGGTAAGCAGAGCAGACATCATCT C T GA 
TTGTCCTCAGCCTCCACTTCCCCAGAGTAAATTCAAATTGAATCGAGCTCTGCTGCTCTGGTTGG 
T TGTAGT AGTGATCAGGAAACAGATC TCAGCAAAGC CACTGAGGAGGAGGC T GTGC TGAGT T TGT 
GTGGCTGGAATCTCTGGGTAAGGAACTT7\AAGAACAAAAATCATCTGGTAATTCTTTCCTAGAAG 
GATCACAGCCCCTGGGATTCCAAGGCATTGGATCCAGTCTCT7^AGAAGGCTGCTGTACTGGTTGA 
ATTGTGTCCCCCTCAAATTCACATCCTTCTTGGAATCTCAGTCTGTGAGTTTATTTGGAGATAAG 
GTCTCTGCAGATGTAGTTAGTTAAGACAAGGTCATGCTGGATGAAGGTAGACCTAAATTCAATAT 
GACT GGT T T CC T TGTATGAAAAGGAGAGGACACAGAGACAGAGGAGACGCGGGGAAGAC TAT GTA 
AAGATGAAGGCAGAGATCGGAGT TTTGCAGCCACAAGCTAAGAAACACCAAGGAT TGTGGCAACC 
AT CAGAAGC T TGGAAGAGGCAAAGAAGAAT TC T TCCC TAGAGGC T T TAGAGGGATAACGGC TC T G 
CTGAAACCTTAATCTCAGACTTCCAGCCTCCTGAACGAAGAAAGAATAAATTTCGGCTGTTTTAA 
GCCACCAAGGATAAT T GG T TACAGCAGC T C TAGGAAAC TAATACAGC T GCTAAAAT GATCCC TGT 
CTCCTCGTGTTTACATTCTGTGTGTGTCCCCTCCCACAATGTACCAAAGTTGTCTTTGTGACCAA 
TAGAATAT GGCAGAAGTGATGGCAT GCCAC T TC CAAGAT TAGG T T ATAAAAGACAC TGCAGC TT C 
TACTTGAGCCCTCTCTCTCTGCCACCCACCGCCCCCAATCTATCTTGGCTCACTCGCTCTGGGGG 
AAGCTAGCTGCCATGCTATGAGCAGGCCTATAAAGAGACTTACGTGGTAAAAAATGAAGTCTCCT 
GCCCACAGCCACAT TAGT GAACCTAGAAGCAGAGAC T CTGTGAGATAAT CGATGT T TGTTGTTTT 
AAGT T GC TCAGT TT T GGT C TAAC T T GT TAT GCAGCAATAGATAAATAAT AT GCAGAGAAAGAG 
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FIGURE 1S2 

MVLSGALC FRMKDS ALKVL YLHNNQLLAGGLHAGKVI KGEE I S WPNRWLDAS LS PVILGVQGGS 
QCLSCGVGQEPTLTLEPVNIMELYLGAKESKSFTFYRRDMGLTSSFESAAYPGWFLCTVPEADQP 
VRLTQLPENGGWNAPITDFYFQQCD 

N-myristoylation sites. 

amino acids 29-34, 30-35, 60-65, 63-68, 73-78, 91-96, 106-111 

Interleukin-1 signature. 

amino acids 111-131 

Interleukin-1 proteins . 

amino acids 8-29, 83-120, 95-134, 64-103 
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FIGURE 153 



CTTCAGAACAGGTTCTCCTTCCCCAGTCACCAGTTGCTCGAGTTAGAATTGTCTGCAATGGCCGC 
CCTGCAGAAATCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTGCCTCCTTCTCTTGG 
CCCTCTTGGTACAGGGAGGAGCAGCTGCGCCCATCAGCTCCCACTGCAGGCTTGACAAGTCCAAC 
T T CCAGCAGCCCTATATCACCAACCGCACC T TCATGC T GGC TAAGGAGGCT AGCT TGGC TGATAA 
CAACACAGACGTTCGTCTCATTGGGGAGAAACTGTTCCACGGAGTCAGTATGAGTGAGCGCTGCT 
ATCTGATGAAGCAGGTGCTGAACTTCACCCTTGAAGAAGTGCTGTTCCCTCAATCTGATAGGTTC 
CAGCCTTATATGCAGGAGGTGGTGCCCTTCCTGGCCAGGCTCAGCAACAGGCTAAGCACATGTCA 
TAT T G AAG G T GAT GAC C T G CAT AT C C AGAG G AAT G T G C AAAAG C T G AAG G AC AC AG T GAAAAAGC 
T TGGAGAGAGTGGAGAGATCAAAGCAATTGGAGAACTGGAT TTGCTGT TTAT GTCTCTGAGAAAT 
GCCTGCATT TGA CCAGAGCAAAGCTGAAAAATGAATAACTAACCCCCTTTCCCTGCTAGAAATAA 
CAATTAGATGCCCCAAAGCGAT TT TT TTTAACCAAAAGGAAGATGGGAAGCCAAACTCCATCATG 
ATGGGTGGATTCCAAATGAACCCCTGCGTTAGTTACAAAGGAAACCAATGCCACTTTTGTTTATA 
AGACCAGAAGGTAGAC T T TC TAAGCATAGATAT T TAT T GATAACAT T T CAT T GTAACTGGTG T TC 
T ATACACAGAAAACAAT T TATT T T TTAAAT AAT TGTCTTTTT CCATAAAAAAGAT TAC T T TCCAT 
TCCTTTAGGGGAAAAAACCCCTAAATAGCTTCATGTTTCCATAATCAGTACTTTATATTTATAAA 
T G TAT T TAT TAT TAT TATAAGACT GCAT T T TAT T TATATCATT T TAT T AATATGGAT T TAT T TAT 
AG AAAC AT CAT T C GAT AT T GC TAC T T GAG T G T AAGG C T AAT AT T GAT AT T TAT GAC AAT AAT TAT 
AGAGC T ATAACATGT T TAT T TGACCT CAATAAACACT TGGATAT CCC 
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FIGURE 154 

MAALQKSVS S FLMGTLAT S CLLLLALLVQGGAAAP I S SHCRLDKSNFQQPY I TNRT FMLAKEAS L 
ADNNTDVRLIGEKLFHGVSMSERCYMKQVLNFTLEEVLFPQSDRFQPYMQEWPFLARLSNRLS 
TCHIEGDDLHIQRNVQKLKDTVKKLGESGEIKAIGELDLLFMSLRNACI 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

N-glycosylation sites. 

amino acids 54-58, 68-72, 97-101 

N-myristoylation sites. 

amino acids 14-20, 82-88 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 10-21 
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FIGURE .155 



GGCTTGCTGAAAATAAAATCAGGACTCCTAACCTGCTCCAGTCAGCCTGCTTCCACGAGGCCTGT 
CAGT CAG T GCCCGAC T TGT GACT GAGT GT GCAGTGCCCAGCAT GTACCAGGTCAGTGCAGAGGGC 
TGCCTGAGGGCTGTGCTGAGAGGGAGAGGAGCAGAGATGCTGCTGAGGGTGGAGGGAGGCCAAGC 
TGCCAGGTTTGGGGCTGGGGGCCAAGTGGAGTGAGAAACTGGGATCCCAGGGGGAGGGTGCAGAT 
GAG G GAGC GAC C CAGAT TAGG T GAG GAC AG T T C TC T CAT T AGC C TT T T C CT ACAG GT GGT T GCAT 
TCTTGGCAATGGTCATGGGAACCCACACCTACAGCCACTGGCCCAGCTGCTGCCCCAGCAAAGGG 
CAGGACACCTCTGAGGAGCTGCTGAGGTGGAGCACTGTGCCTGTGCCTCCCCTAGAGCCTGCTAG 
GCCCAACCGCCACCCAGAGTCCTGTAGGGCCAGTGAAGATGGACCCCTCAACAGCAGGGCCATCT 
CCCCCTGGAGATATGAGTTGGACAGAGACTTGAACCGGCTCCCCCAGGACCTGTACCACGCCCGT 
TGCCTGTGCCCGCACTGCGTCAGCCTACAGACAGGCTCCCACATGGACCCCCGGGGCAACTCGGA 
GCTGCTCTACCACAACCAGACTGTCTTCTACAGGCGGCCATGCCATGGCGAGAAGGGCACCCACA 
111 AGGGCTACTGCCTGGAGCGCAGGCTGTACCGTGTTTCCTTAGCTTGTGTGTGTGTGCGGCCCCGT 
w GTGATGGGC TAG CCGGACCTGCTGGAGGCTGGTCCCTTTTTGGGAAACCTGGAGCCAGGTGTACA 
^ ACCACTTGCCATGAAGGGCCAGGATGCCCAGATGCTTGGCCCCTGTGAAGTGCTGTCTGGAGCAG 
PI CAGGATCCCGGGACAGGATGGGGGGCTTTGGGGAAAACCTGCACTTCTGCACATTTTGAAAAGAG 
%n CAGCTGCTGCTTAGGGCCGCCGGAAGCTGGTGTCCTGTCATTTTCTCTCAGGAAAGGTTTTCAAA 

O GTTCTGCCCATTTCTGGAGGCCACCACTCCTGTCTCTTCCTCTTTTCCCATCCCCTGCTACCCTG 

■iTl 

GCCCAGCACAGGCACT TTCTAGATAT TTCCCCCTTGCTGGAGAAGAAAGAGCCCC TGGT TT TAT T 
TGTTTGTTTACTCATCACTCAGTGAGCATCTACTTTGGGTGCATTCTAGTGTAGTTACTAGTCTT 
T T GACAT G GAT GAT T C T GAGGAG GAAG C T G T TAT T GAAT G TATAGAGAT T T ATC CAAATAAAT AT 
CTTTATTTAAAAATGAAAAA 
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FIGURE 1*6 

MRERPRLGEDSSLISLFLQWAFIxAMVMGTHTYSHWPSCCPSKGQDTSEELLRWSTVPVPPLEPA 
RPNRHPESCRASEDGPLNSRAISPWRYELDRDLNRLPQDLYHARCLCPHCVSLQTGSHMDPRGNS 
ELLYHNQTVFYRRPCHGEKGTHKGYCLERRLYRVSLACVCVRPRVMG 

Important features of the protein: 
Signal peptide: 

amino acids 1-32 

N-glycosylation site. 

amino acids 136-140 

Tyrosine kinase phosphorylation site. 

amino acids 127-135 

N-myristoylation sites. 

amino acids 44-50, 150-156 
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FIGURE 157 

CCGGC GATG TCGCTCGTGCTGCTAAGCCTGGCCGCGCTGTGCAGGAGCGCCGTACCCCGAGAGCC 
GACCGTTCAATGTGGCTCTGAAACTGGGCCATCTCCAGAGTGGATGCTACAACATGATCTAATCC 
C CGGAGAC T TGAGGGACC TCCGAGTAGAACCTGT TACAAC TAGT GT TGCAACAGGGGAC TAT T CA 
ATTTTGATGAATGTAAGCTGGGTACTCCGGGCAGATGCCAGCATCCGCTTGTTGAAGGCCACCAA 
GAT TTGTGT GACGGGCAAAAGCAACT TCCAGTCCTACAGC TGTGTGAGGT GCAATTACACAGAGG 
CCTTCCAGACTCAGACCAGACCCTCTGGTGGTAAATGGACATTTTCCTACATCGGCTTCCCTGTA 
GAGCT GAACACAGT C TAT TTCAT T GGGGCC CATAATAT TC C TAATGCAAATATGAAT GAAGATGG 
Izk CCCTTCCATGTCTGTGAATTTCACCTCACCAGGCTGCCTAGACCACATAATGAAATATAAAAAAA 
O AGT GT GT CAAGGCCGGAAGCCT GT GGGAT CCGAACAT CAC TGC T TGTAAGAAGAATGAGGAGACA 

— GTAGAAGTGAACTTCACAACCACTCCCCTGGGAAACAGATACATGGCT CT TATCCAACACAGCAC 

m 

T A TAT CATCGGGT T TT CT CAGGT GT T T GAGCCACAC CAGAAGAAACAAAC GC GAGC T TCAGTGGTGA 

Ifl TTCCAGTGACTGGGGATAGTGAAGGTGCTACGGTGCAGCTGACTCCATATTTTCCTACTTGTGGC 
U| AGCGACTGCATCCGACATAAAGGAACAGTTGTGCTCTGCCCACAAACAGGCGTCCCTTTCCCTCT 
4i GGATAAGAACAAAAGC71AGCCGGGAGGCTGGCTGCCTCTCCTCCTGCTGTCTCTGCTGGTGGCCA 
% z> CATGGGTGCTGGTGGCAGGGATCTATCTAATGTGGAGGCACGAAAGGATCAAGAAGACTTCCTTT 
?S TCTACCACCACACTACTGCCCCCCATTAAGGTTCTTGTGGTTTACCCATCTGAAATATGTTTCCA 
fi T CACACAAT TT GT TAC TTCAC T GAAT T T CT TCAAAACCAT TGCAGAAGTGAGGT CATCC T TGAAA 

lij AGTGGCAGAAAAAGAAAATAGCAGAGATGGGTCCAGTGCAGTGGCTTGCCACTCAAAAGAAGGCA 
%rJ GCAGACA?VAGTCGTCTTCCTTCTTTCC7\ATGACGTCAACAGTGTGTGCGATGGTACCTGTGGCAA 
ly GAGCGAGGGCAGTCCCAGTGAGAACTCTCAAGACCTCTTCCCCCTTGCCTTTAACCTTTTCTGCA 

GTGATCTAAGAAGCCAGAT TCATCTGCACAAATACGTGGTGGTCTACT TTAGAGAGAT TGATACA 
AAAGACGAT TACAATGC TC TCAG TG TCTGC CCCAAG TACCACC TCAT GAAGGAT GCCAC TGC TTT 
CTGTGCAGAACTTCTCCATGTCAAGCAGCAGGTGTCAGCAGGAAAAAGATCACAAGCCTGCCACG 
ATGGCTGCTGCTCCTTGTAG 
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FIGURE 1*8 



MSLVLLSLAALCRSAVPREPTVQCGSETGPSPEWMLQHDLIPGDLRDLRVEPVTTSVATGDYSILMIWSWV 
LRADAS I RLLKATKICVTGKSN FQSY SC VRCNYTE AFQTQTRPSGGKWT FS Y IGFPVELNTVY FIGAHNI P 
NANMNE DG P SMS VN FT S PGCL DH IMKY KKKC VKAG S L W DPN I T AC KKN EETVEVNFTTTP LGNRYMAL I QH 
ST I IGFSQVFEPHQKKQTRAS WI PVTGDSEGATVQLTPY FPTCGSDCIRHKGT WLCPQTGVP FPLDNNK 
SKPGGWLPLLLLSLLVATWVLVAGIYIMWRHERIKKTSFSTTTLLPPIECVLVVYPSEICFHHTICYFTEFL 
QNHCRSEVILEKWQKKKIAEMGPVQWIATQKKAADKVVFLLSNDVNSVCDGTCGKSEGSPSENSQDLFPLA 
FNLFCSDLRSQIHLHKYVWYFREIDTKDDYNALSVCPKYHLL1KDATAFCAELLHVKQQVSAGKRSQACHD 
GCCSL 

Important, features of the protein: 
Signal peptide: 

amino acids 1-14 

Transmembrane domain: 

amino acids 290-309 

N-glycosylation sites. 

amino acids 67 - 71, 103 - 107, 156 - 160, 183 - 187, 197 - 201 and 283 
- 287 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 228 - 232 and 319 - 323 

Casein kinase II phosphorylation sites. 

amino acids 178 - 182, 402 - 406, 414 - 418 and 453 - 457 

N-myristoylation site. 

amino acids 116-122 

Amidation site. 

amino acids 488-452 
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FIGURE 15Q 



AGCCACCAGCGCAAC ATGA CAGTGAAGACCCTGCATGGCCCAGCCATGGTCAAGTACTTGCTGCT 
GT CGATAT T GGGGC T T GCC TT T C TGAGTGAGGCGGCAGC T CGGAAAAT C CC CAAAGTAGGACAT A 
CTTTTTTCCAAAAGCCTGAGAGTTGCCCGCCTGTGCCAGGAGGTAGTATGAAGCTTGACATTGGC 
ATCATCAATGAAAACCAGCGCGTTTCCATGTCACGTAACATCGAGAGCCGCTCCACCTCCCCCTG 
GAATTACACTGTCACTTGGGACCCCAACCGGTACCCCTCGGAAGTTGTACAGGCCCAGTGTAGGA 
ACTTGGGCTGCATCAATGCTCAAGGAAAGGAAGACATCTCCATGAATTCCGTTCCCATCCAGCAA 
GAGACCCTGGTCGTCCGGAGGAAGCACCAAGGCTGCTCTGTTTCTTTCCAGTTGGAGAAGGTGCT 

H GGTGACTGTTGGCTGGACCTGCGTCACCCCTGTCATCCACCATGTGCAGTAAGAGGTGCATATCC 

W AC T CAGCT GAAGAAG 

P 



2 ■;! 

if"! 

ru 
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FIGURE 160 



MTVKTLHGPAMVKYLLLS I LGLAFLSEAAARKI PKVGHTFFQKPESCPPVPGGSMKLDI GI INEN 
QRVSMSRNIESRSTSPWNYTVTWDPNRYPSEWQAQCRNLGCINAQGKEDISMNSVPIQQETLW 
RRKHQGCSVSFQLEKVLVTVGCTCVTPVIHHVQ 

Signal sequence: 

amino acids 1-30 



y k N-glycosylation site. 

p amino acids 83-87 

J° r * N-myristoylation sites. 

I s •! 

amino acids 106-111, 136-141 




O 
ffi 
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FIGURE 161 



ACACT GGCCAAACAAAAACGAAAGCACTCCG TGC TGGAAGTAGGAGGAGAGT CAGGAC TCCCAG G 
ACAGAGAGTGCACAAACTACCCAGCACAGCCCCCTCCGCCCCCTCTGGAGGCTGAAGAGGGATTC 
CAGCCCCTGCCACCCACAGACACGGGCTGACTGGGGTGTCTGCCCCCCTTGGGGGGGGGCAGCAC 
AGGGCCTCAGGCCTGGGTGCCACCTGGCACCTAGAAGATGCCTGTGCCCTGGTTCTTGCTGTCCT 
TGGCACTGGGCCGAAGCCCAGTGGTCCTTTCTCTGGAGAGGCTTGTGGGGCCTCAGGACGCTACC 
CACTGCTCTCCGGGCCTCTCCTGCCGCCTCTGGGACAG'TGACATACTCTGCCTGCCTGGGGACAT 
CGTGCCTGCTCCGGGCCCCGTGCTGGCGCCTACGCACCTGCAGACAGAGCTGGTGCTGAGGTGCC 
AGAAGGAGACCGACTGTGACCTCTGTCTGCGTGTGGCTGTCCACTTGGCCGTGCATGGGCACTGG 
GAAGAGCC T GAAGAT GAGGAAAAG T T T GGAGGAGCAGCTGACTCAGGGGTGGAGGAGCCTAGGAA 
TGCCTCTCTCCAGGCCCAAGTCGTGCTCTCCTTCCAGGCCTACCCTACTGCCCGCTGCGTCCTGC 
y TGGAGGTGCAAGTGCCTGCTGCCCTTGTGCAGTTTGGTCAGTCTGTGGGCTCTGTGGTATATGAC 
I* TGCTTCGAGGCTGCCCTAGGGAGTGAGGTACGAATCTGGTCCTATACTCAGCCCAGGTACGAGAA 
CI GGAACTCAACCACACACAGCAGCTGCC TGCCCTGCCC TGGC T CAACG T GT CAGCAGAT GGTGACA 

O ACGTGCATCTGGTTCTGAATGTCTCTGAGGAGCAGCACTTCGGCCTCTCCCTGTACTGGAATCAG 
ff| GTCCAGGGCCCCCCAAAACCCCGGTGGCACAAAAACCTGACTGGACCGCAGATCATTACCTTGAA 
71 CCACACAGACCTGGTTCCCTGCCTCTGTATTCAGGTGTGGCCTCTGGAACCTGACTCCGTTAGGA 
j!f CGAACATCTGCCCCTTCAGGGAGGACCCCCGCGCACACCAGAACCTCTGGCAAGCCGCCCGACTG 
III CGACTGCTGACCCTGCAGAGCTGGCTGCTGGACGCACCGTGCTCGCTGCCCGCAGAAGCGGCACT 
*p GTGCTGGCGGGCTCCGGGTGGGGACCCCTGCCAGCCACTGGTCCCACCGCTTTCCTGGGAGAACG 
J* TCACTGTGGACAAGGTTCTCGAGTTCCCATTGCTGAAAGGCCACCCTAACCTCTGTGTTCAGGTG 

AACAGCTCGGAGAAGCTGCAGCTGCAGGAGTGCTTGTGGGCTGACTCCCTGGGGCCTCTCAAAGA 
^ CGATGTGCTACTGTTGGAGACACGAGGCCCCCAGGACAACAGATCCCTCTGTGCCTTGGAACCCA 
y GTGGCT GTACT TCAC TACCCAGCAAAGCC TCCACGAGGGCAGCT CGCC T TGGAGAGTAC T TAC TA 

111 CAAGACCTGCAGTCAGGCCAGTGTCTGCAGCTATGGGACGATGACTTGGGAGCGCTATGGGCCTG 
fi: CCCCATGGACAAATACATCCACAAGCGCTGGGCCCTCGTGTGGCTGGCCTGCCTACTCTTTGCCG 
1 5 « CTGCGCTTTCCCTCATCCTCCTTCTCAAAAAGGATCACGCGAAAGGGTGGCTGAGGCTCTTGAAA 

CAGGACGTCCGCTCGGGGGCGGCCGCCAGGGGCCGCGCGGCTCTGCTCCTCTACTCAGCCGATGA 
CI CTCGGGTTTCGAGCGCCTGGTGGGCGCCCTGGCGTCGGCCCTGTGCCAGCTGCCGCTGCGCGTGG 
R| CCGTAGACCTGTGGAGCCGTCGTGAACTGAGCGCGCAGGGGCCCGTGGCTTGGTTTCACGCGCAG 

CGGCGCCAGACCCTGCAGGAGGGCGGCGTGGTGGTCTTGCTCTTCTCTCCCGGTGCGGTGGCGCT 
GTGCAGCGAGTGGCTACAGGATGGGGTGTCCGGGCCCGGGGCGCAGGGCCCGCACGACGCCTTCC 
GCGCCTCGCTCAGCTGCGTGCTGCCCGACTTCTTGCAGGGCCGGGCGCCCGGCAGCTACGTGGGG 
GCCTGCTTCGACAGGCTGCTCCACCCGGACGCCGTACCCGCCCTTTTCCGCACCGTGCCCGTCTT 
CACACTGCCCTCCCAACTGCCAGACTTCCTGGGGGCCCTGCAGCAGCCTCGCGCCCCGCGTTCCG 
GGCGGCTCCAAGAGAGAGCGGAGCAAGTGTCCCGGGCCCTTCAGCCAGCCCTGGATAGCTACTTC 
CATCCCCCGGGGACTCCCGCGCCGGGACGCGGGGTGGGACCAGGGGCGGGACCTGGGGCGGGGGA 
CGGGACT TAA ATAAAGGCAGACGCTGTTTTTCTAAAAAAA 
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FIGURE 162 



MPVPWFLLSLALGRSPWLSLERLVGPQDATHCSPGLSCRLWDSDILCLPGDIVPAPGPVLAPTHLQTELV 
LRCQKETDCDLCLRVAVHLAVHGHWEEPEDEEKFGGAADSGVEEPRNASLQAQWLSFQAYPTARCVLLEV 
QVPAALVQFGQSVGSWYDCFEAALGSEVRIWSYTQPRYEKELNHTQQLPALPWLNVSADGDNVHLVLNVS 
EEQHFGLSLYWNQVQGPPKPRWHKNLTGPQIITLNHTDLVPCLCIQVWPLEPDSVRTNICPFREDPRAHQN 
LWQAARLRLLTLQSWLLDAPCSLPAEAALCWRAPGGDPCQPLVPPLSWENVTVDKVLEFPLLKGHPNLCVQ 
VNSSEKLQLQECLWADSLGPLKDDVLLLETRGPQDNRSLCALEPSGCTSLPSKASTRAARLGEYLLQDLQS 
GQCLQLWDDDLGALWACPMDKYIHKRWALVWLACLLFAAALSLILLLKECDHAKGWLRLLKQDVRSGAAARG 
RAALLL Y S ADDS G FE RL VGALASALC QL PL RVAVDL WS RREL S AQG P VAW FH AQ RRQTLQ EGG WVLL F S P 
GAVALCSEWLQDGVSGPGAHGPHDAFRASLSCVLPDFLQGRAPGSYVGACFDRLLHPDAVPALFRTVPVFT 
m LPSQLPDFLGALQQPRAPRSGRLQERAEQVSRALQPALDSYFHPPGTPAPGRGVGPGAGPGAGDGT 

W 

If! 

|lj Signal sequence: 

y| amino acids 1-20 

%»4 

Transmembrane domain. 

^ s;; amino acids 453-475 

Q 

N-glycosylation sites. 

j*| amino acids 118-121, 186-189, 198-201, 211-214, 238-241, 248-251, 

P| 334-337, 357-360, 391-394 



% II 



Glycosaminoglycan attachment site. 

amino acids 583-58 6 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 552-555 

N-myristoylation sites. 

amino acids 107-112, 152-157, 319-324, 438-443, 516-521, 612-617, 
692-697, 696-701, 700-705 
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FIGURE 162 

GGGAGGGCTCTGTGCCAGCCCC GATGA GGACGCTGCTGACCATCTTGACTGTGGGATCCCTGGCT 
GCTCACGCCCCTGAGGACCCCTCGGATCTGCTCCAGCACGTGAAATTCCAGTCCAGCAACTTTGA 
AAACATCCTGACGTGGGACAGCGGGCCAGAGGGCACCCCAGACACGGTCTACAGCATCGAGTATA 
AGACGTACGGAGAGAGGGACTGGGTGGCAAAGAAGGGCTGTCAGCGGATCACCCGGAAGTCCTGC 
AACCTGACGGTGGAGACGGGCAACCTCACGGAGCTCTACTATGCCAGGGTCACCGCT 
GTCAGTGCGGGAGGCCGGTCAGCCACCAAGATGACTGACAGGTTCAGCTCTCTGCAGCACACTAC 
CCTCAAGCCACCTGATGTGACCTGTATCTCCAAAGTGAGATCGATTCAGATGATTGTTCATCCTA 
CCCCCACGCCAATCCGTGCAGGCGATGGCCACCGGCTAACCCTGGAAGACATCTTCCATGACCTG 
T T C T ACCAC T TAGAGC TCCAGGT CAACC GCACC TACCAAAT GCACC T T GGAGGGAAGCAGAGAGA 
ATATGAGTTCTTCGGCCTGACCCCTGACACAGAGTTCCTTGGCACCATCATGATTTGCGTTCCCA 
CCTGGGCCAAGGAGAGTGCCCCCTACATGTGCCGAGTGAAGACACTGCCAGACCGGACATGGACC 
TACTCCTTCTCCGGAGCCTTCCTGTTCTCCATGGGCTTCCTCGTCGCAGTACTCTGCTACCTGAG 
Q CTACAGATATGTCACCAAGCCGCCTGCACCTCCCAACTCCCTGAACGTCCAGCGAGTCCTGACTT 
Q TCCAGCCGCTGCGCTTCATCCAGGAGCACGTCCTGATCCCTGTCTTTGACCTCAGCGGCCCCAGC 
m AGTCTGGCCCAGCCTGTCCAGTACTCCCAGATCAGGGTGTCTGGACCCAGGGAGCCCGCAGGAGC 
f^l T CCACAGCGGCATAGCCTG T CCGAGAT CACCTAC T TAGGGCAGCCAGACAT CT CCATCC TCCAGC 

.^f CCTCCAACGTGCCACCTCCCCAGATCCTCTCCCCACTGTCCTATGCCCCAAACGCTGCCCCTGAG 
If! GTCGGGCCCCCATCCTATGCACCTCAGGTGACCCCCGAAGCTCAATTCCCATTCTACGCCCCACA 

GGCCATCTCTAAGGTCCAGCCTTCCTCCTATGCCCCTCAAGCCACTCCGGACAGCTGGCCTCCCT 
7* CCTATGGGGTATGCATGGAAGGTTCTGGCAAAGACTGCCCCACTGGGACACTTTCTAGTCCTAAA 

CACC T TAGGCC TAAAGGTCAGC T T CAGAAAGAGCCACCAGC TGGAAGC T GCAT GT TAGGT GGCCT 
* w TTCTCTGCAGGAGGTGACCTCCTTGGCTATGGAGGAATCCCAAGAAGCAAAATCATTGCACCAGC 
d= CCCT GGGGATT TGCACAGACAGAACAT CT GACCCAAATGT GC TACACAG TGGGGAGGAAGGGACA 

|fl CCACAGTACCTAAAGGGCCAGCTCCCCCTCCTCTCCTCAGTCCAGATCGAGGGCCACCCCATGTC 
#4 CCTCCCTTTGCAACCTCCTTCCGGTCCATGTTCCCCCTCGGACCAAGGTCCAAGTCCCTGGGGCC 

TGCTGGAGTCCCTTGTGTGTCCCAAGGATGAAGCCAAGAGCCCAGCCCCTGAGACCTCAGACCTG 
W GAGCAGCCCACAGAACTGGATTCTCTTTTCAGAGGCCTGGCCCTGACTGTGCAGTGGGAGTCCTG 
0 AGGGGAATGGGAAAGGCTTGGTGCTTCCTCCCTGTCCCTACCCAGTGTCACATCCTTGGCTGTCA 
flj ATCCCATGCCTGCCCATGCCACACACTCTGCGATCTGGCCTCAGACGGGTGCCCTTGAGAGAAGC 

AGAGGGAGTGGCATGCAGGGCCCCTGCCATGGGTGCGCTCCTCACCGGAACAAAGCAGCATGATA 
AGGACTGCAGCGGGGGAGCTCTGGGGAGCAGCTTGTGTAGACAAGCGCGTGCTCGCTGAGCCCTG 
CAAGGCAGAAAT GACAGT GCAAGGAGGAAAT GCAGGGAAAC TCCCGAGG TCCAGAGCCCCAC C T C 
CTAACACCATGGATTCAAAGTGCTCAGGGAATTTGCCTCTCCTTGCCCCATTCCTGGCCAGTTTC 
ACAATCTAGCTCGACAGAGCATGAGGCCCCTGCCTCTTCTGTCATTGTTCAAAGGTGGGAAGAGA 
GCCTGGAAAAGAACCAGGCCTGGAAAAGAACCAGAAGGAGGCTGGGCAGAACCAGAACAACCTGC 
ACTTCTGCCAAGGCCAGGGCCAGCAGGACGGCAGGACTCTAGGGAGGGGTGTGGCCTGCAGCTCA 
T T CCCAGCCAGGGCAAC TGCCT GACGT TGCACGAT TT CAGC TT CATT CCTCT GATAGAACAAAGC 
GAAATGCAGGTCCACCAGGGAGGGAGACACACAAGCC TTTTCTGCAGGCAGGAGTT TCAGACCCT 
ATCCTGAGAATGGGGTTTGAAAGGAAGGTGAGGGCTGTGGCCCCTGGACGGGTACAATAACACAC 
TGTACTGATGTCACAACTTTGCAAGCTCTGCCTTGGGTTCAGCCCATCTGGGCTCAAATTCCAGC 
CTCACCACTCACAAGCTGTGTGACTTCAAACAAATGAAATCAGTGCCCAGAACCTCGGTTTCCTC 
ATCTGTAATGTGGGGATCATAACACCTACCTCATGGAGTTGTGGTGAAGATGAAATGAAGTCATG 

TCTTTAAAGTGCTTAATAGTGCCTGGTACATGGGCAGTGCCCAATAAACGGTAGCTATTTAAAAA 
AAAAAAAA 
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FIGURE 164 

MRTLLTILTVGSLAAHAPEDPSDLLQHVKFQSSNFENILTWDSGPEGTPDTVYSIEYKTYGERDW 
VAKKGCQRITRKSCNLTVETGNLTELYYARVTAVSAGGRSATKMTDRFSSLQHTTLKPPDVTCIS 
KVRSIQMIVHPTPTPIRAGDGHRLTLEDIFHDLFYHLELQVNRTYQMHLGGKQREYEFFGLTPDT 
EFLGTIMICVPTWAKESAPYMCRVKTLPDRTWTYSFSGAFLFSMGFLVAVLCYLSYRYVTKPPAP 
PNSLNVQRVLTFQPLRFIQEHVLIPVFDLSGPSSLAQPVQYSQIRVSGPREPAGAPQRHSLSEIT 
YLGQPDISILQPSNVPPPQILSPLSYAPNAAPEVGPPSYAPQVTPEAQFPFYAPQAISKVQPSSY 
APQAT PDS WPP S YGVCMEGS GKDS PTGTLS S PKHLRPKGQLQKE PPAGS CMLGGLSLQEVT SLAM 
EESQEAKSLHQPLGICTDRTSDPNVLHSGEEGTPQYLKGQLPLLSSVQIEGHPMSLPLQPPSGPC 
SPSDQGPSPWGLLESLVCPKDEAKSPAPETSDLEQPTELDSLFRGLALTVQWES 

Signal sequence. 

amino acids 1-17 

Transmembrane domain. 

amino acids 233-250 

N-glycosylation sites. 

amino acids 80-83, 87-90, 172-175 

N-myristoylation sites. 

amino acids 11-16, 47-52, 102-107, 531-536, 565-570 
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FIGURE 16* 



TGGCCTACTGGAAAAAAAAAAAAAAAAAAAAAAAGTCACCCGGGCCCGCGGTGGCCACAACATCG 
CTGCGGCGCCGGGGCTGCTCTTCTGGCTGTTCGTGCTGGGGGCGGTCTGGTGGGTCCCGGGCCAG 
TCGGATCTCAGCCACGGACGGCGTTTCTCGGACCTCAAAGTGTGCGGGGACGAAGAGTGCAGCAT 
GTTAATGTACCGTGGGAAAGCTCTTGAAGACTTCACGGGCCCTGATTGTCGTTTTGTGAATTTTA 
AAAAAGGT GACGATG TAT AT GT CT ACTACAAAC T GGCAGGGGGAT CCC T TGAACT T TGGGC TGGA 
AGTGTT GAACACAGTTT TGGATAT TT TCCAAAAGATT TGATCAAGGTACTTCATAAATACACGGA 
AGAAGAGCTACATATTCCAGCAGATGAGACAGACTTTGTCTGCTTTGAAGGAGGAAGAGATGATT 
TTAATAGTTATAATGTAGAAGAGCTTTTAGGATCTTTGGAACTGGAGGACTCTGTACCTGAAGAG 
f% T CGAAGAAAGC TGAAGAAGT TT C T CAGCACAGAGAGAAAT C TCCT GAGGAGT C TCGGGGGCGT GA 

m ACTTGACCCTGTGCCTGAGCCCGAGGCATTCAGAGCTGATTCAGAGGATGGAGAAGGTGCTTTCT 
III CAGAGAGCACCGAGGGGCTGCAGGGACAGCCCTCAGCTCAGGAGAGCCACCCTCACACCAGCGGT 

in 

CCTGCGGCTAACGCTCAGGGAGTGCAGTCTTCGTTGGACACTTTTGAAGAAATTCTGCACGATAA 
ATTGAAAGTGCCGGGAAGCGAAAGCAGAACTGGCAATAGTTCTCCTGCCTCGGTGGAGCGGGAGA 
s ' AGACAGATGCTTACAAAGTCCTGAAAACAGAAATGAGTCAGAGAGGAAGTGGACAGTGCGTTATT 
O CATTACAGCAAAGGATTTCGTTGGCATCAAAATCTAAGTTTGTTTTACAAAGATTGTTTTTAGTA 

Ul CTAAGCTGCCTTGGCAGTTTGCATTTTTGAGCCAAACAAAAATATATTATTTTCCCTTCTAAGTA 
\r{ AAAAAAAAAAAAAAAAAAAA 

ftS 
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FIGURE 166 



MAAAPGLLFWLFVLGALWWPGQSDLSHGRRFSDLKVCGDEECSMLMYRGKALEDFTGPDCRFVN 
FKKGDDVYVYYKLAGGSLELWAGSVEHSFGYFPKDLIKVLHKYTEEELHIPADETDFVCFEGGRD 
DFNSYNVEELLGSLELEDSVPEESKKAEEVSQHREKSPEESRGRELDPVPEPEAFRADSEDGEGA 
FSESTEGLQGQPSAQESHPHTSGPAANAQGVQSSLDT FEE ILHDKLKVPGSESRTGNS SPAS VER 
EKTDAYKVLKTEMSQRGSGQCVIHYSKGFRWHQNLSLFYKDCF 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

N-glycosylation site. 

amino acids 294-298 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 30-34 

Tyrosine kinase phosphorylation site. 

amino acids 67-76 

N-myristoylation sites. 

amino acids 205-211, 225-231, 277-283 

Amidation site. 

amino acids 28-32 
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FIGURE 167 

CCAGGACCAGGGCGCACCGGCTCAGCCTCTCACTTGTCAGAGGCCGGGGAAGAGAAGCAAAGCGC 
AACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACCCCCTAACTTC 
AGTCCCCCAAACGCGCACCCTCGAAGTCTTGAACTCCAGCCCCGCACATCCACGCGCGGCACAGG 
CGCGGCAGGCGGCAGGTCCCGGCCGAAGGCGATGCGCGCAGGGGGTCGGGCAGCTGGGCTCGGGC 
GGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCATATTCAGAGTCGCGGGCTGCGCCCTG 
GGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCTGCTGCCACCGCGCCGCGATGAGCCGCGTGG 
TCTCGCTGCTGCTGGGCGCCGCGCTGCTCTGCGGCCACGGAGCCTTCTGCCGCCGCGTGGTCAGC 
GGCCAAAAGGTGTGTTTTGCTGACTTCAAGCATCCCTGCTACAAAATGGCCTACTTCCATGAACT 
GTCCAGCCGAGTGAGCTTTCAGGAGGCACGCCTGGCTTGTGAGAGTGAGGGAGGAGTCCTCCTCA 
GCCTTGAGAATGAAGCAGAACAGAAGTTAATAGAGAGCATGTTGCAAAACCTGACAAAACCCGGG 
ACAGGGATTTCTGATGGTGATTTCTGGATAGGGCTTTGGAGGAATGGAGATGGGCAAACATCTGG 
TGCCTGCCCAGATCTCTACCAGTGGTCTGATGGAAGCAATTCCCAGTACCGAAACTGGTACACAG 
ATGAACCTTCCTGCGGAAGTGAAAAGTGTGTTGTGATGTATCACCAACCAACTGCCAATCCTGGC 
CTTGGGGGTCCCTACCTTTACCAGTGGAATGATGACAGGTGTAACATGAAGCACAATTATATTTG 
CAAGTATGAACCAGAGATTAATCCAACAGCCCCTGTAGAAAAGCCTTATCTTACAAATCAACCAG 
GAGACACCCATCAGAATGTGGTTGTTACTGAAGCAGGTATAATTCCCAATCTAATTTATGTTGTT 
ATACCAACAATACCCCTGCTCTTACTGATACTGGTTGCTTTTGGAACCTGTTGTTTCCAGATGCT 
GCATAAAAGTAAAGGAAGAACAAAAACTAGTCCAAACCAGTCTACACTGTGGATTTCAAAGAGTA 
CCAGAAAAGAAAGTGGCATGGAAGTATAATAACTCATTGACTTGGTTCCAGAATTTTGTAATTCT 
GGATCTGTATAAGGAATGGCATCAGAACAATAGCTTGGAATGGCTTGAAATCACAAAGGATCTGC 
AAGATGAACTGTAAGCTCCCCCTTGAGGCAAATATTAAAGTAATTTTTATATGTCTATTATTTCA 
TTTAAAGAATATGCTGTGCTAATAATGGAGTGAGACATGCTTATTTTGCTAAAGGATGCACCCAA 
ACTTCAAACTTCAAGCAAATGAAATGGACAATGCAGATAAAGTTGTTATCAACACGTCGGGAGTA 
TGTGTGTTAGAAGCAATTCCTTTTATTTCTTTCACCTTTCATAAGTTGTTATCTAGTCAATGTAA 
TGTATATTGTATTGAAATTTACAGTGTGCAAAAGTATTTTACCTTTGCATAAGTGTTTGATAAAA 
ATGAACTGTTCTAATATTTATTTTTATGGCATCTCATTTTTCAATACATGCTCTTTTGATTAAAG 
AAACTTATTACTGTTGTCAACTGAATTCACACACACACAAATATAGTACCATAGAAAAAGTTTGT 
TTTCTCGAAATAATTCATCTTTCAGCTTCTCTGCTTTTGGTCAATGTCTAGGAAATCTCTTCAGA 
AATAAGAAGCTATTTCATTAAGTGTGATATAAACCTCCTCAAACATTTTACTTAGAGGCAAGGAT 
TGTCTAATTTCAATTGTGCAAGACATGTGCCTTATAATTATTTTTAGCTTAAAATTAAACAGATT 
TTGTAATAATGTAACTTTGTTAATAGGTGCATAAACACTAATGCAGTCAATTTGAACAAAAGAAG 
TGACATACACAATATAAATCATATGTCTTCACACGTTGCCTATATAATGAGAAGCAGCTCTCTGA 
GGGTTCTGAAATCAATGTGGTCCCTCTCTTGCCCACTAAACAAAGATGGTTGTTCGGGGTTTGGG 
ATTGACACTGGAGGCAGATAGTTGCAAAGTTAGTCTAAGGTTTCCCTAGCTGTATTTAGCCTCTG 
ACTATATTAGTATACAAAGAGGTCATGTGGTTGAGACCAGGTGAATAGTCACTATCAGTGTGGAG 
ACAAGCACAGCACACAGACATTTTAGGAAGGAAAGGAACTACGAAATCGTGTGAAAATGGGTTGG 
AACCCATCAGTGATCGCATATTCATTGATGAGGGTTTGCTTGAGATAGAAAATGGTGGCTCCTTT 
CTGTCTTATCTCCTAGTTTCTTCAATGCTTACGCCTTGTTCTTCTCAAGAGAAAGTTGTAACTCT 
CTGGTCTTCATATGTCCCTGTGCTCCTTTTAACCAAATAAAGAGTTCTTGTTTCTGGGGGAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 168 



MSRWSLLLGAALLCGHGAFCRRWSGQKVCFADFKHPCYKMAYFHELSSRVSFQEARLACESE 
GGVLLSLENEAEQKLIESMLQNLTKPGTGISDGDFWIGLWRNGDGQTSGACPDLYQWSDGSNSQ 
YRNWYTDEPSCGSEKCWMYHQPTANPGLGGPYLYQWNDDRCNMKHNYICKYEPEINPTAPVEK 
P YLTNQPGDTHQNWVTEAG 1 1 PNL I YWI PT I PLLLL I LVAFGTCC FQMLHKSKGRTKTS PNQ 
S TLW I S KS TRKE SGMEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

Transmembrane domain: 

amino acids 214-235 

N-glycosylation sites. 

amino acids 86-89, 255-258 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 266-2 69 

N-myristoylation sites. 

amino acids 27-32, 66-71, 91-96, 93-98, 102-107, 109-114, 140- 
145, 212-217 
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